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NOVEL ANTIGENS of HELICOBACTER PYLORI 

Field of the Invention 

5 The present invention relates to novel nucleic acids and polypeptides 

relating to Helicobacter pylori The nucleic acid sequences and polypeptides are useful 
for diagnostic and therapeutic purposes. 

Background of the Invention 

1 0 Helicobacter pylori (H pylori) is a gram-negative, S-shaped, 

microaerophilic bacterium that was discovered and cultured from a human gastric 
biopsy specimen infection. [Warren, J.R. et al., Lancet, 1: 1273-1275, (1983) ; and 
Marshall et al., Microbios Lett, 25: 83-88, (1984)]. H pylori has been strongly linked to 
chronic gastritis and duodenal ulcer disease. [Rathbone et al., Gut, 27: 635-641, 

1 5 (1986)]. Additional evidence has developed for an etiological role of H pylori in non- 
ulcer dyspepsia, gastric ulcer disease, and gastric adenocarcinoma. [Blaser, M.J., 
Trends Microbiol, 1: 255-260, (1993)]. H. pylori colonizes the human gastric mucosa, 
establishing an infection that usually persists for many years. About 30-50% of the 
human population appear to be chronically infected. [Rainer, H. et al, Biologicals, 25: 

20 175-177, (1997)]. The current recommended treatment for chronic H pylori infection is 
multiple antibiotic treatment combined with a proton pump inhibitor, or with bismuth 
salts. However, this treatment may not fully resolve the infection and resistance to 
antibiotics can occur. H. pylori infection in humans induces a strong local and systemic 
immune response; however, the immune response if often unable to clear the infection. 

25 Accordingly, work has led to developing potential H pylori vaccines. Various antigens, 
proteins and genes have been reported in this area. [See Bolin et al., PCT Application 
No. 96/38475, filed June 3, 1996; Doidge et al., PCT Application No. 96/33220, filed 
April 19, 1996; Clancy et al., PCT Application No. 96/25430, filed February 15, 1996; 
Chan et al., PCT Application No. 96/12965, filed October 19, 1995; Byrne et al., PCT 

30 Application No. 96/01273, filed July 3, 1995; Alemohammad M.M., in U.S. Patent 
5,262,156, filed August 12, 1991; Allan et al., PCT Application No. 97/03359, filed 
June 28, 1996; and Dettmar et al., German Patent Application 195235554 Al based on 
Great Britain Application No. 94 1 3074.] As an alternative to potential vaccines and 
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diagnostic currently provided in the art referenced above, novel methods of curing or 
preventing H pylori infection through the use of novel K pylori proteins, as well as 
genes encoding such proteins are described herein. Proteins were selected for use as an 
antigen and/or vaccine candidate based on the following criteria: (i) the antigen is 
5 located on the bacterial surface, (ii) the antigen is conserved among H. pylori clinical 
isolates, (iii) the antigen elicits functional antibodies, (iv) the antigen is able to confer 
protection to vaccinated mice from challenge with a live organism. 

Summary of the Invention 

1 0 This invention relates to novel H. pylori bacterial surface proteins and nucleic 

acid sequences encoding therefor, in particular novel H pylori bacterial surface proteins 
having molecular weights of approximately 75, 77, and 79 kilo daltons (kDa). The 
mature processed forms of these proteins share a common amino-terminal amino acid 
sequence. The proteins and nucleic acid sequences of the present invention have 

1 5 diagnostic and therapeutic utility for H. pylori and other Helicobacter species. They can 
be used to detect the presence of H pylori and other Helicobacter species in a sample, 
and to screen compounds for the ability to interfere with the H pylori life cycle or to 
inhibit H. pylori infection. More specifically, this invention includes embodiments 
relating to isolated nucleic acid sequences corresponding to the entire coding sequences 

20 of H. pylori surface proteins or portions thereof, nucleic acids capable of binding mRNA 
from H. pylori surface proteins and methods for producing H. pylori surface proteins or 
portions thereof using peptide synthesis and recombinant techniques. Additional 
embodiments are also directed to antigenic and vaccine compositions based on agents 
prepared from the proteins and nucleic acids of this invention and methods for treatment 

25 and prevention of H pylori infections employing such compositions. 



Brief Description of the Drawings 

Figure 1 : Figure 1(A) depicts an SDS-PAGE gel with bands showing the 75kDa and 
30 77kDa proteins in lane 3 in comparison to Zwittergent™ 3-14 crude extract of H pylori 
outer membrane proteins in lane 2 and molecular weight standards in lane 1 . Figure 
1(B) depicts a Western blot of monoclonal antibody 64-27 with the co-purified 75kDa 
and 77kDa proteins from ATCC 43579, as described in Example 1. Lane 1 is the 
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molecular weight markers, lane 2 is the crude extract and lane 3 is the co-purified 
proteins. 

Figure 2 depicts an electron micrograph of the surface labeling of K pylori strain PBCC 
5 1 105 with mouse polyclonal antisera to a co-purified mixture of 1 Sill kDa proteins, as 
described in Example 2. 

Figure 3 depicts two graphs of Flow Cytometry analysis of strain PBCC 1 105 with 
labeled an anti-75/77 mouse polyclonal antibody, at day 0 (Figure 3 A) and day 49 
1 0 (Figure 3B) following injection of mice with a mixture of 75kDa and 77kDa proteins, as 
described in Example 2. Profilel is ATCC 43579 (homologous strain); profile 2 is strain 
PBCC 1 105; profile 3 is strain ATCC 43504; profile 4 is strain SS-1 and profile 5 is a 
urease-negative strain. 

1 5 Figure 4 is a graph depicting the bactericidal activity of anti-75/77kDa polyclonal 

mouse sera, as described in Example 3, with either incomplete Freund's adjuvant or 
MPL™. 

Figure 5: This Figure depicts the mouse protection data (in colony forming units) from 
20 the K pylori SS 1 experimental challenge following vaccination of mice with a mixture 
of the co-purified 75kDa/77kDa proteins, as described in Example 4. 

Figure 6: This Figure depicts a DNA (SEQ ID NO 19)for the 75kDa gene from strain 
ATCC 43579. 

25 

Figure 7: This Figure depicts the predicted translated protein sequence for the DNA 
sequence (SEQ ID NO 21)for the 75kDa gene from strain ATCC 43579 in figure 6. 

Figure 8: This Figure depicts a DNA sequence (SEQ ID NO 20)for the 79kDa gene 
30 from strain ATCC 43579. 

Figure 9: This Figure depicts the predicted translated protein sequence (SEQ ID NO 
22)for the DNA sequence for the 79kDa gene from strain ATCC 43579 in figure 8. 
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Figure 10: This Figure depicts SDS-PAGE gel bands from comparison expression 
experiments from Example 13 for recombinant 75kDa protein expressed from a low 
copy number plasm id_pBAD24 and the high copy number T7 expression plasmid 
5 pRSETb. The bands show the increased amounts of protein 75 kDa expressed. 

Figure 11: This Figure depicts SDS-PAGE gel bands from comparison expression 
experiments from Example 13 for recombinant 77kDa protein expressed from a low 
copy number plasmid j>ETl 7 and the high copy number T7 expression plasmid pRSETb. 
1 0 The bands show the increased amounts of protein 77 kDa expressed. 

Figure 12: This Figure depicts SDS-PAGE gel bands from comparison expression 
experiments from Example 13 for recombinant 79kDa protein expressed from a low 
copy number plasmid pBAD24 and the high copy number T7 expression plasmid 
1 5 pRSETb. The bands show the increased amounts of protein 79 kDa expressed. 

Figure 13: This Figure depicts mouse protection data (in colony forming units) from the 
K pylori SS 1 experimental challenge following vaccination of mice with a mixture of 
the co-purified 75kDa/77kDa proteins 

20 

Figure 13: This Figure depicts therapeutic effect (in colony forming units) of a mixture 
of the co-purified 75kDa/77kDa proteins when vaccinating mice after infection with K 
pylori SSI Experiments included intragastric vaccination and a subcutaneous 
vaccination. 

25 

Detailed Description of the Invention 

One aspect of the present invention provides an isolated, substantially purified 
H, pylori polypeptide selected from the group consisting of (i) a polypeptide having a 
molecular weight of about 75 kDa; (ii) a polypeptide having a molecular weight of about 
30 77 kDa; and (iii) a polypeptide having a molecular weight of about 79 kDa; wherein the 
mature processed form of each polypeptide has a starting sequence consisting essentially 
of EDDGFYTSVG YQIGEAAQMV (SEQ. ID NO.7). The present invention also relates 
to isolated polypeptides. Preferred embodiments of the invention relate to an isolated 
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polypeptide of K pylori selected from the group consisting of (i) a polypeptide having a 
molecular weight of about 75 kDa and having the amino acid sequence of SEQ. ID 
NO.l or SEQ ID NO. 19; (ii) a polypeptide having a molecular weight of about 77 kDa 
and having the amino acid sequence of SEQ. ID NO.2; and (iii) a polypeptide having a 
5 molecular weight of about 79 kDa and having the amino acid sequence of SEQ. ID 

NO.3 or SEQ ID NO. 20. Preferably, the polypeptides of this invention have antigenic 
properties, such as being reactive with H. pylori antibodies. Antigens can be based on 
the isolated polypeptides sequences, or allelic or other variants thereof, which are 
biological equivalents. Suitable biological equivalents have about 70 to about 80%, and 
1 0 most preferably at least about 90%, similarity to one of the amino acid sequences 

referred to above, or to a portion thereof, provided the equivalent is capable of eliciting 
substantially the same antigenic properties as the isolated polypeptide sequences 
specified hereinabove. 

1 5 The biological equivalents are obtained by generating variants and 

modifications to the isolated polypeptides of this invention. These variants and 
modifications to the isolated polypeptides are obtained by altering the amino acid 
sequences by insertion, deletion or substitution of one or more amino acids. The 
polypeptides are then selected for use as an antigen and/or vaccine candidate based on 

20 the following criteria: (i) the antigen is located on the bacterial surface, (ii) the antigen 
is conserved among H. pylori clinical isolates, (iii) the antigen elicits functional 
antibodies, (iv) the antigen is able to confer protection to vaccinated mice from 
challenge with a live organism. Modifying the amino acid, for example by substitution, 
the amino acids of the protein to create a polypeptide having substantially the same or 

25 improved qualities. The amino acid changes are achieved by changing the codons of the 
nucleic acid sequence. It is known that such polypeptides can be obtained based on 
substituting certain amino acids for other amino acids in the polypeptide structure in 
order to modify or improve antigenic or immunogenic activity (see, e.g. Kyte and 
Doolittle, 1982, Hopp, US Patent 4,554,101, each incorporated herein by reference). 

30 For example ,through substitution of alternative amino acids, small conformational 
changes may be conferred upon a polypeptide which result in increased activity or 
enhanced immune response. Alternatively, amino acid substitutions in certain 
polypeptides may be utilized to provide residues which may then be linked to other 
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molecules to provide peptide-molecule conjugates which retain sufficient antigenic 
properties of the starting polypeptide to be useful for other purposes. For example, a 
selected polypeptide of the present invention may be bound to a solid support in order to 
have particular advantages for diagnostic applications. 

5 

One can use the hydropathic index of amino acids in conferring interactive 
biological function on a polypeptide, as discussed by Kyte and Doolittle (1982), wherein 
it is found that certain amino acids may be substituted for other amino acids having 
similar hydropathic indices and still retain a similar biological activity. Alternatively, 

1 0 substitution of like amino acids may be made on the basis of hydrophilicity, particularly 
where the biological function desired in the polypeptide to be generated is intended for 
use in immunological embodiments. US Patent 4,554,101, which states that the greatest 
local average hydrophilicity of a "protein," as governed by the hydrophilicity of its 
adjacent amino acids, correlates with its immunogenicity. Accordingly, it is noted that 

1 5 substitutions can be made based on the hydrophilicity assigned to each amino acid. 

In using either the hydrophilicity index or hydropathic index, which assigns 
values to each amino acid, it is preferred to conduct substitutions of amino acids where 
these values are +2, with +1 being particularly preferred, and those within + 0.5 being 
20 the. most preferred substitutions. 

Preferable characteristics of the polypeptides of this invention include one or 
more of the following: (a) being a membrane protein or being a protein directly 
associated with a membrane; (b) capable of being separated as a protein using an SDS 
25 acrylamide (10%) gel; (c) generating antibody which exhibits bactericidal activity upon 
injection in a mouse; and/or (d) reducing the colonization of K pylori in mice upon 
delivery thereto. 

The isolated polypeptides having molecular weights of about 75, 77, and 79 kDa 
30 are particular useful in antigenic compositions for several reasons. First, using 
monoclonal antibodies derived by injecting mice with whole bacteria, these 
polypeptides were identified as closely related outer membrane proteins of H. pylori. 
Second, our analyses by a number of in vitro techniques indicate that the polypeptides 
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are surface localized and that they are expressed in a variety of clinically relevant 
strains. Third, the polyclonal antibodies generated by injecting mice with a mixture of 
the purified polypeptides are bactericidal. Fourth, we observed that following oral 
vaccination with the polypeptides admixed with cholera toxin (CT), 100% of mice 
5 challenged with H. pylori strain SSI showed a statistically significant reduction in 
colonization as compared to non-vaccinated control mice. 

Accordingly, further embodiments of this invention relate to an antigenic 
composition comprising (i) at least one isolated polypeptide as disclosed above and (ii) 

10 a pharmaceutically acceptable buffer, diluent, adjuvant or carrier. The antigenic 
composition may comprise a carrier, which in turn may be conjugated to said 
polypeptide. In additional embodiments, the antigenic composition may further 
comprise an adjuvant. Preferably, these compositions have therapeutic and prophylatic 
applications as vaccines in preventing and/or ameliorating H. pylori infection. In such 

1 5 applications immunologically effective amount (as discussed herein) of at least one 
polypeptide of this invention is employed. 

The formulation of such prophylactic or therapeutic antigenic compositions is 
well known to persons skilled in this field. Antigenic compositions of the invention 

20 containing antigenic components (e.g., H. pylori polypeptide or fragment thereof or 

nucleic acid encoding an K pylori polypeptide or fragment thereof) preferably include a 
pharmaceutically acceptable carrier. Suitable pharmaceutically acceptable carriers 
and/or diluents include any and all conventional solvents, dispersion media, fillers, solid 
carriers, aqueous solutions, coatings, antibacterial and antifungal agents, isotonic and 

25 absorption delaying agents, and the like. The term "pharmaceutically acceptable 

carrier" refers to a carrier that does not cause an allergic reaction or other untoward 
effect in patients to whom it is administered. Suitable pharmaceutically acceptable 
carriers include, for example, one or more of water, saline, phosphate buffered saline, 
dextrose, glycerol, ethanol and the like, as well as combinations thereof. 

30 Pharmaceutically acceptable carriers may further comprise minor amounts of auxiliary 
substances such as wetting or emulsifying agents, preservatives or buffers, which 
enhance the shelf life or effectiveness of the antibody. The use of such media and 
agents for pharmaceutically active substances is well known in the art. Except insofar 
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as any conventional media or agent is incompatible with the active ingredient, use 
thereof in the antigenic compositions of the present invention is contemplated. 

Such antigenic compositions are conventionally administered parenterally, e.g., 
5 by injection, either subcutaneously or intramuscularly. Methods for intramuscular 
immunization are described by Wolff et al. and by Sedegah et al. Other modes of 
administration include oral and pulmonary formulations, suppositories, and transdermal 
applications. Oral immunization is preferred over parenteral methods for inducing 
protection against infection by H. pylori (See Czinn et al.). Oral formulations include 
1 0 such normally employed excipients as, for example, pharmaceutical grades of mannitol, 
lactose, starch, magnesium stearate, sodium saccharine, cellulose, magnesium 
carbonate, and the like. 

The antigenic compositions of the invention can include an adjuvant, including, 
1 5 but not limited to aluminum hydroxide; aluminum phosphate; Stimulon™ QS-21 
(Aquila Biopharmaceuticals, Inc., Worcester, MA); MPL™ (3-O-deacylated 
monophosphoryl lipid A; RIBI ImmunoChem Research, Hamilton, MT), IL-12 
(Genetics Institute, Cambridge, MA) N-acetyl-muramyI~L-theronyl-D-isoglutamine 
(thr-MDP); N-acetyl-nor-muramyl-L-alanyl-D-isoglutamine (CGP 1 1637, referred to as 
20 nor-MDP); N-acetylmuramyl-L-alanyl-D-isoglutaminyl-L-alanine-2-(r-2-dipalmitoyl- 
sn-gIycero-3-hydroxyphos-phoryloxy--ethylamine (CGP 19835 A, referred to a MTP- 
PE); and cholera toxin. Others which may be used are non-toxic derivatives of cholera 
toxin, including its B subunit, and/or conjugates or genetically engineered fusions of the 
H. pylori polypeptide with cholera toxin or its B subunit, procholeragenoid, fungal 
25 polysaccharides, including schizophyllan, muramyl dipeptide, muramyl dipeptide 

derivatives, phorbol esters, labile toxin of E. coli, non-K pylori bacterial lysates, block 
polymers or saponins. 

Preferably also, this antigenic composition or an isolated polypeptide of this 
30 invention is used in a vaccine composition for oral administration which includes a 
mucosal adjuvant. 
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In a particularly preferred aspect of this invention, an oral vaccine composition 
comprising an antigenic composition in association with a mucosal adjuvant, is used for 
the treatment or prevention of H. pylori infection in a human host. 



5 The mucosal adjuvant can be cholera toxin; however, preferably, mucosal 

adjuvants other than cholera toxin which may be used in accordance with the present 
invention include non-toxic derivatives of cholera toxin, such as the B sub-unit (CTB), 
chemically modified cholera toxin, or related proteins produced by modification of the 
cholera toxin amino acid sequence. These may be added to, or conjugated with, the 

1 0 Helicobacter antigenic composition. The same techniques can be applied to other 

molecules with mucosal adjuvant or delivery properties such as Escherichia coli heat 
labile toxin (LT). Other compounds with mucosal adjuvant or delivery activity may be 
used such as bile; polycations such as DEAE-dextran and polyornithine; detergents such 
as sodium dodecyl benzene sulphate; lipid-conjugated materials; antibiotics such as 

1 5 streptomycin; vitamin A; and other compounds that alter the structural or functional 

integrity of mucosal surfaces. Other mucosally active compounds include derivatives of 
microbial structures such as MDP; acridine and cimetidine. QS-21, MPL™, and IL-12, 
which described above, may also be used 

20 The Helicobacter antigenic composition of this invention may be delivered in 

the form of ISCOMS (immune stimulating complexes), ISCOMS containing CTB, 
liposomes or encapsulated in compounds such as acrylates or poly(DL-lactide-co- 
glycoside) to form microspheres of a size suited to adsorption by M cells. Alternatively, 
micro or nanoparticles may be covalently attached to molecules such as vitamin B12 

25 which have specific gut receptors. The Helicobacter isolated polypeptides of this 
invention may also be incorporated into oily emulsions. 



The Helicobacter isolated polypeptides of the present invention may be 
administered as the sole active immunogen in an antigenic composition. Alternatively, 
30 however, the antigenic, or vaccine, composition may include other active immunogens, 
including other Helicobacter antigens such as urease, lipopolysaccharide, Hsp60, VacA, 
CagA or catalase, as well as immunologically active antigens against other pathogenic 
species. 



9 



WO 00/00614 



PCT/US99/14375 



One of the important aspects of this invention relates to a method of inducing 
immune responses in a mammal comprising the step of providing to said mammal an 
antigenic composition of this invention. Preferred embodiments relate to a method for 
5 the treatment or prevention of Helicobacter infection in a human comprising 
administering to a human an immunologically effective amount of an antigenic 
composition. Immunologically effective amount, as used herein, means the 
administration of that amount to a mammalian host, either in a single dose or as part of a 
series of doses, sufficient to at least cause the immune system of the individual treated 

10 to generate a response that reduces the clinical impact of the bacterial infection. This 
may range from a minimal decrease in bacterial burden to prevention of the infection. 
Ideally, the treated individual will not exhibit the more serious clinical manifestations of 
the Helicobacter infection. The dosage amount can vary depending upon specific 
conditions of the individual. This amount can be determined in routine trials by means 

15 known to those skilled in the art. 



Another specific aspect of the present invention relates to using a vaccine vector 
expressing an isolated Helicobacter polypeptide, or an immunogenic fragment thereof 
Accordingly, in a further aspect this invention provides a method of inducing an 

20 immune response in a mammal, which comprises providing to a mammal a vaccine 

vector expressing at least one, or a mixture of isolated Helicobacter polypeptides of this 
invention, or an immunogenic fragment thereof. The isolated polypeptides of the 
present invention can be delivered to the mammal using a live vaccine vector, in 
particular using live recombinant bacteria, viruses or other live agents, containing the 

25 genetic material necessary for the expression of the an antigenic polypeptide or 

immunogenic fragment as a foreign polypeptide. Particularly, bacteria that colonize the 
gastrointestinal tract, such as Salmonella, Shigella, Yersinia, Vibrio, Escherichia and 
BCG have been developed as vaccine vectors, and these and other examples are 
discussed by Holmgren et al. (1992) and McGhee et al. (1992). 

30 

An additional embodiment of the present invention relates to a method of 
inducing an immune response in a human comprising administering to said human an 
amount of a DNA molecule encoding an isolated polypeptide of this invention, 
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optionally with a transfection-facilitating agent, where said polypeptide retains 
immunogenicity and, when incorporated into an antigenic composition or vaccine and 
administered to a human, provides protection without inducing enhanced disease upon 
subsequent infection of the human with Helicobacter pathogen, such as H pylori. 
5 Transfection-facilitating agents are known in the art. 

The present invention also relates to an antibody, which may either be a 
monoclonal or polyclonal antibody, specific for antigenic polypeptides as described 
above. Such antibodies may be produced by methods which are well known to those 
1 0 skilled in the art. The antibodies of this invention can be employed in a method for the 
treatment or prevention of Helicobacter infection in mammalian hosts, which comprises 
administration of an immunologically effective amount of antibody, specific for 
antigenic polypeptide as described above. 

15 It is proposed that the monoclonal antibodies of the present invention will find 

useful application in standard immunochemical procedures, such as ELISA and western 
blot methods, as well as other procedures which may utilize antibodies specific to H. 
pylori proteins. While ELISAs are preferred, it will be readily appreciated that such 
assays include RIAs and other non-enzyme linked antibody binding assays or 

20 procedures. Additionally, it is proposed that monoclonal antibodies specific to the 

particular H pylori protein or polypeptides may be utilized in other useful applications. 
For example, their use in immunoadsorbent protocols may be useful in purifying native 
or recombinant H pylori proteins or variants thereof. 

25 It also is proposed that the disclosed H. pylori polypeptides of the invention will 

find use as antigens for raising antibodies and in immunoassays for the detection of anti- 
75/77 kDa antigen-reactive antibodies. In a variation on this embodiment, samples 
suspected of containing H. pylori may be screened, in immunoassay format, for 
reactivity against antibodies specific for 75, 77 and 79 kDa polypeptides of this 

30 invention. Results from such analyses may then be used to determine the presence of H 
pylori and potential infection. 
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Diagnostic immunoassays include direct culturing of bodily fluids or tissue, 
either in liquid culture or on a solid support such as nutrient agar. A typical assay 
involves collecting a sample of bodily fluid from a patient and placing the sample under 
conditions optimum for growth of the pathogen. The determination can then be made as 
5 to whether the microbe exists in the sample. Further analysis can be carried out to 
determine the hemolyzing properties of the microbe. 



Immunoassays encompassed by the present invention include, but are not 
limited to those described in U.S. Patent No. 4,367,1 10 (double monoclonal antibody 
1 0 sandwich assay) and U.S. Patent No. 4,452,901 (western blot), which U.S. Patents are 
incorporated herein by reference. Other assays include immunoprecipitation of labeled 
ligands and immunocytochemistry, both in vitro and in vivo. 

Immunoassays, in their most simple and direct sense, are binding assays. 
1 5 Certain preferred immunoassays are the various types of enzyme linked immunosorbent 
assays (ELISAs) and radioimmunoassays (RIAs) known in the art. 
Immunohistochemical detection using tissue sections is also particularly useful. 
However, it will be readily appreciated that detection is not limited to such techniques, 
and western blotting, dot blotting, FACS analyses, and the like may also be used. 

20 

In one exemplary ELISA, the anti-75/77 kDa antibodies of the invention are 
immobilized onto a selected surface exhibiting protein affinity, such as a well in a 
polystyrene microtiter plate. Then, a test composition suspected of containing the 
desired antigen, such as a clinical sample, is added to the wells. After binding and 

25 washing to remove non- specifically bound immune complexes, the bound antigen may 
be detected. Detection is generally achieved by the addition of another antibody, 
specific for the desired antigen, that is linked to a detectable label. This type of ELISA 
is a simple "sandwich ELISA." Detection may also be achieved by the addition of a 
second antibody specific for the desired antigen, followed by the addition of a third 

30 antibody that has binding affinity for the second antibody, with the third antibody being 
linked to a detectable label. 
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In another exemplary ELISA, the samples suspected of containing an H. pylori 
polypeptide are immobilized onto the well surface and then contacted with the anti- 
75/77 kDa antibodies. After binding and appropriate washing, the bound immune 
complexes are detected. Where the initial antigen specific antibodies are linked to a 
5 detectable label, the immune complexes may be detected directly. Again, the immune 
complexes may be detected using a second antibody that has binding affinity for the first 
antigen specific antibody, with the second antibody being linked to a detectable label. 



Further methods include the detection of primary immune complexes by a two 
1 0 step approach. A second binding ligand, such as an antibody, that has binding affinity 
for the primary antibody is used to form secondary immune complexes, as described 
above. After washing, the secondary immune complexes are contacted with a third 
binding ligand or antibody that has binding affinity for the second antibody, again under 
conditions effective and for a period of time sufficient to allow the formation of immune 
1 5 complexes (tertiary immune complexes). The third ligand or antibody is linked to a 

detectable label, allowing detection of the tertiary immune complexes thus formed. This 
system may provide for signal amplification if desired. 

Competition ELISAs are also possible in which test samples compete for 
20 binding with known amounts of labeled antigens or antibodies. The amount of reactive 
species in the unknown sample is determined by mixing the sample with the known 
labeled species before or during incubation with coated wells. (Antigen or antibodies 
may also be linked to a solid support, such as in the form of beads, dipstick, membrane 
or column matrix, and the sample to be analyzed applied to the immobilized antigen or 
25 antibody). The presence of reactive species in the sample acts to reduce the amount of 
labeled species available for binding to the well and thus reduces the ultimate signal. 



Irrespective of the format employed, ELISAs have certain features in common, 
such as coating, incubating or binding, washing to remove non-specifically bound 
30 species, and detecting the bound immune complexes. These are described below. 



In coating a plate with either antigen or antibody, one will generally incubate 
the wells of the plate with a solution of the antigen or antibody, either overnight or for a 
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specified period. The wells of the plate will then be washed to remove incompletely 
absorbed material. Any remaining available surfaces of the wells are then "coated" with 
a nonspecific protein that is antigenically neutral with regard to the test antisera. These 
include bovine serum albumin (BSA), casein and solutions of milk powder. The coating 
5 allows for blocking of nonspecific adsorption sites on the immobilizing surface and thus 
reduces the background caused by nonspecific binding of antisera onto the surface. 

After binding of antigenic material to the well, coating with a non-reactive 
material to reduce background, and washing to remove unbound material, the 

1 0 immobilizing surface is contacted with the antisera or clinical or biological extract to be 
tested in a manner conducive to immune complex (antigen/antibody) formation. Such 
conditions preferably include diluting the antisera with diluents such as BSA, bovine 
gamma globulin (BGG) and phosphate buffered saline (PBS)/Tween. These added 
agents also tend to assist in the reduction of nonspecific background. The layered 

1 5 antisera is then allowed to incubate for from 2 to 4 hours, at temperatures preferably on 
the order of 25° to 27°C. Following incubation, the ant i sera-contacted surface is 
washed so as to remove non-immunocomplexed material. A preferred washing 
procedure includes washing with a solution such as PBS/Tween, or borate buffer. 

20 Following formation of specific immunocomplexes between the test sample and 

the bound antigen, and subsequent washing, the occurrence and even amount of 
immunocomplex formation may be determined by subjecting same to a second antibody 
having specificity for the first. Of course, in that the test sample will typically be of 
human origin, the second antibody will preferably be an antibody having specificity in 

25 general for human IgG. To provide a detecting means, the second antibody will 

preferably have an associated enzyme that will generate a color development upon 
incubating with an appropriate chromogenic substrate. Thus, for example, one will 
desire to contact and incubate the ant i sera-bound surface with a urease or peroxidase- 
conjugated anti-human IgG for a period of time and under conditions which favor the 

30 development of immunocomplex formation (e.g., incubation for 2 hours at room 
temperature in a PBS-containing solution such as PBS-Tween). 
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After incubation with the second enzyme-tagged antibody, and subsequent to 
washing to remove unbound material, the amount of label is quantified by incubation 
with a chromogenic substrate such as urea and bromocresol purple or 2.2'-azino-di-(e- 
ethyl-benzthiazoline-6-sulfonic acid [ABTS] and H2O2, in the case of peroxidase as the 
5 enzyme label. Quantification is then achieved by measuring the degree of color 

generation, e.g. y using a visible spectra spectrophotometer. Alternatively, the label may 
be a chemilluminescent one. The use of such labels is described in U.S. Patent Nos. 
5,3 10,687, 5,238,808 and 5,221,605. 

1 0 Further embodiments relate to isolated nucleic acid sequences encoding the 

polypeptides of this invention, in particular the nucleic acid sequences as set forth in 
SEQ. ID NO.4, 5, and 6, or being substantially similar to all or a portion thereof. The 
term substantially similar means having a least 50-70%, more preferably 70-80%, and 
most preferably 80 or 90% identity to one of said sequences. Such substantially similar 

1 5 nucleic acid sequences hybridize under high stringency southern hybridization 

conditions with at least one of the nucleic acid sequences set forth in SEQ ID Nos. 4, 5 
or 6. 



The nucleic acid molecule may be RNA or DNA, single stranded or double 
20 stranded, in linear or covalently closed circular form. For the purposes of defining high 
stringency southern hybridization conditions , reference can conveniently be made to 
Sambrook et al..l989, Book 2, pp 9.31-9.58. 

It will be appreciated that the sequence of nucleotides of this aspect of the 
25 invention may be obtained from natural, synthetic or semi-synthetic sources; 

furthermore, this nucleotide sequence may be a naturally occurring sequence, or it may 
be related by mutation, including single or multiple base substitutions, deletions, 
insertions and inversions, to such a naturally occurring sequence, provided always that 
the nucleic acid molecule comprising such a sequence is capable of being expressed as a 
30 Helicobacter antigen as broadly described above. 

The nucleotide sequence may have expression control sequences positioned 
adjacent to it, such control sequences usually being derived from a heterologous source. 
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This invention also provides a recombinant DN A molecule comprising an 
expression control sequence having promoter sequences and initiator sequences and a 
nucleotide sequence which codes for a Helicobacter antigen, the nucleotide sequence 
5 being located 3' to the promoter and initiator sequences. In yet another aspect, the 
invention provides a recombinant DNA cloning vehicle capable of expressing a 
Helicobacter antigen comprising an expression control sequence having promoter 
sequences and initiator sequences, and a nucleotide sequence which codes for a 
Helicobacter antigen, the nucleotide sequence being located 3' to the promoter and 
1 0 initiator sequences. Cloning vehicles can be any plasmid (or vector) known in the art, 
including viral vectors, such as alphavirus pox viruses. In a further aspect, there is 
provided a host cell containing a recombinant DNA cloning vehicle and/or a 
recombinant DNA molecule as described above. 

1 5 Suitable expression control sequences and host cell/cloning vehicle 

combinations are well known in the art, and are described by way of example, in 
Sambrook et al. (1989). One embodiment of this invention relates to expression systems 
that employ the use of plasmids. Preferred embodiments of the invention employ 
plasmids which exhibit high copy number upon replication in a transformed host cell. 

20 Copy number refers to number of copies of the plasmid, or the genes contained therein, 
which replicate in a host cell upon induction of the plasmid. The copy number within a 
cell determines the copies of the desired nucleotide sequence (or gene). This copy 
number equates to the gene dosage. Certain plasmids are characterized as high copy 
number. Generally, high copy number refers to a plasmid which generates at least about 

25 100 copies per cell and preferably generating from about 100 to about 700 or 1,000 

copies per cell. The selection of the appropriate copy number for the expression of the 
nucleotides sequences of this invention when combined with the selection of a strong 
promoter improves the amount of desired polypeptide that is generated in a host cell. In 
each high copy number plasmid, the nucleic acid sequence encoding a selected 

30 polypeptide is operably linked to a strong promoter (Hannig et al.). Plasmid copy 
number is determined by the nature the origin of replication of the plasmid and 
corresponding cis acting control elements, together these genetic elements are defined as 
a replicon. In plasmids which normally reside in E. coli, there are several different 
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replicons. Most common is pMBl and its close relative, ColEl . Plasmids which carry 
these replicons maintain about 15-20 copies per cell under normal growth conditions. A 
small protein encoded by the rop gene adjacent to the origin of replication negatively 
regulates pMBl/ColEl plasmid replication. Plasmids with deletions in the rop gene or 
5 mutations in the site of rop action have increased replication and higher copy number. 
One example of this is in the pUC plasmids whose copy number is increased to 500-700 
copies per cell. Other lower copy number replicons are pi 5 A (10-12 copies/cell), and 
pSClOl (5-10 copies per cell). Also, the moderate copy number of pMBl and ColEl 
plasmids can be modified to high copy numbers by inhibiting the E. coli protein 
1 0 synthesis. 

It is further noted that the copy number exhibited in a specific host is the proper 
measurement of plasmid copy number. Plasmids which normally do not reside in gram 
negative hosts or E. coli hosts may not be regulated for replication when introduced into 

1 5 E. coli hosts. When such promoters are not regulated, they can exhibit high copy 
number in the selected host. An example of this is pNG2, a plasmid from the gram 
positive bacteria Corynebacterium diphtherial Whereas, the normal copy number for 
this plasmid in C. dipththeriae is 1-2 copies per cell, transformants of the plasmid in E. 
coli are estimated to have >100 copies of the plasmid (Serwold-Davis, T.et aI.,PNAS, 

20 84:4964-8, 1987). 

Strong promoters are selected such they are easily regulated in order that they 
may be repressed during culture growth "towards maximal cell numbers. In many 
embodiments, the strong promoters can also be induced so that the host cell 

25 overproduces the recombinant polypeptide at a desired level, usually in excess of about 
10 to about 30% total cellular protein. The promoter is selected to maintain the desired 
level of polypeptide expression. Exemplary promoters are T7 promoter from T7 
bacteriophage, arabinose promoter for the araBAD operon, lambda phage promoters 
(such as P L and P R ) the trc and tac promoters. The T7 promoter is controlled by the T7 

30 RNA polymerase gene which is a very active enzyme; it elongates RNA chains five 
times faster than the E. coli RNA polymerase. In addition the polymerase is very 
selective for specific promoter sequences and termination signals so that the action of 
the enzyme is targeted specifically to the gene of interest. In one system developed by 
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Studier et al. (referenced, infra), the T7 polymerase gene has been integrated into the 
host chromosome under the control of the lac promoter. Upon induction with IPTG, T7 
RNA polymerase is produced and acts to transcribe a T7 promoter gene housed on an 
expression plasmid in the cell. In addition, a second plasmid expressing T7 lysozyme is 
5 included to control the background expression of the T7 polymerase. For T7, an 

alternative induction system for the expression of genes by the T7 promoter is to grow 
E. coli cells containing the T7 recombinant plasmid to the desired density and then 
introduce the T7 polymerase gene by infecting with a bacteriophage which carries that 
gene. 

10 

Expression can also be enhanced based on the choice of cellular 
compartmentalization. Outer membrane proteins are often difficult to overexpress 
recombinantly because of the requirement for transport to the outer membrane and 
correct insertion into that membrane. Overexpression of full length outer membrane 

1 5 proteins which contain the leader sequences will many times overcome the host cell 
export machinery leading to cessation of growth and low recombinant protein yield. 
Cloning the mature sequences of an outer membrane protein behind translation start 
signals provided by the expression vector can eliminate the need for the host cell to 
transport the protein to the membrane and allow the cell to overexpress the recombinant 

20 protein as inclusion bodies in the cytosol which are relatively stable and resistant to 
proteolysis. 

One or more of the above considerations are included when selecting a desired 
host. In general, one can select or optimize the host based on factors which can 

25 influence the yield of recombinantly expressed proteins. These factors include growth 
and induction conditions, mRNA stability, codon usage, translational efficiency and the 
presence of transcriptional terminators to minimize promoter read through. For 
example, one can modify the ability of the host to produce proteases. Stress induced 
proteases in the host cell are induced during induction of recombinant protein 

30 expression, therefore, the use of protease deficient hosts and/or coexpression of 
chaperones can be employed to minimize the proteolysis of the recombinant upon 
induction of the protein in culture. In the case at hand, the polypeptides of the invention 
contain cysteine bonds. Accordingly, one can aid cysteine bond formation, for example 
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through the use of trx hosts, which can help with the proper folding in the cytosol by 
altering the redox potential in the cytosol. 

Suitable host cell or host strains for the practice of this invention are omp Tlon 
5 host strains such as BL21 , BLR, B834; trxB host strains such as AD494, Ion clpA 

mutant strains such as KY 42263 and Ion clpA hslVU mutant strains such as KY 2266 
(for the latter two strains the Kanemori, M. et al. J. Bact. 1997. 79:7219-7225). 

When host cells, which are modified or selected for enhanced expression, are 

1 0 combined with the high copy number plasmids (also referred to as over- expressing 

plasmids) of this invention, the polypeptide is expressed as inclusion bodies. Preferably, 
the nucleotide sequence selected for forming inclusion bodies is the nucleotide sequence 
corresponding to the mature portion of the polypeptide. The signal sequence of the 
desired polypeptide is not included in this nucleotide sequence for the mature 

1 5 polypeptide. A desired H. pylori protein can be obtained by a method comprising: (a) 
transforming a selected host cell with at least one high copy number plasmid, which 
comprises the nucleotide sequence of interest operably linked to a strong promoter, and 
(b) growing the transformed host cell in culture media. Often, it is advantageous to use 
an inducible promoter to control the timing of the expression of the nucleotide sequence. 

20 Basically, when the promoter is inducible, the rate of transcription increases in response 
to the inducing agent or inducing conditions for the promoter. A selectable marker can 
used in the plasmid in order to grow the transformed host in the presence of a selecting 
agent that works in combination with the chosen marker. The K pylori polypeptide 
produced by the above method is expressed as inclusion bodies and is also soluble in 

25 detergent extractions, without the requiring denaturants, such as urea or guanidine for 
solubilizing the inclusion bodies. These two characteristics are important since they 
allow for the application of straightforward and practical means in isolating and 
purifying the polypeptide. In the absence of expressing the polypeptide as an inclusion 
body and as a soluble material, the difficulties in the purification steps are generally 

30 overwhelming. 
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In yet further aspects, there are provided fused polypeptides comprising a 
Helicobacter polypeptide of this invention and an additional polypeptide, for example a 
polypeptide coded for by the DNA of a cloning vehicle, fused thereto. Such a fused 
5 polypeptide can be produced by a host cell transformed or infected with a recombinant 
DNA cloning vehicle as described above and it can be subsequently isolated from the 
host cell to provide the fused polypeptide substantially free of other host cell proteins. 

Based on the above-identified specific sequences, one may obtain numerous 
1 0 additional isolated nucleic acid sequences encoding the polypeptides of this invention 
due to the degeneracy of the genetic code. Amino acids and their codons are well- 
known. Accordingly, using site-directed mutagenesis of one polypeptide of H pylori, 
one can generate additional nucleic acid sequences, as desired. These methods of 
generating nucleic acid sequences and fragments thereof provide a convenient manner in 
1 5 which to generate portions of the polypeptides for fusion molecules. 

Using the nucleic acid sequence described herein, one can generate synthetic 
polypeptides displaying the antigenicity of a Helicobacter isolated polypeptide of this 
invention. As used herein, the term "synthetic" means that the polypeptides have been 

20 produced by chemical or biological means, such as by means of chemical synthesis or 
by recombinant DNA techniques leading to biological synthesis. Such polypeptides 
can, of course, be obtained by cleavage of a fused polypeptide as described above and 
separation of the desired polypeptide from the additional polypeptide coded for by the 
DNA of the cloning vehicle by methods well known in the art. Alternatively, once the 

25 amino acid sequence of the desired polypeptide has been established, for example, by 
determination of the nucleotide sequence coding for the desired polypeptide, the 
polypeptide may be produced synthetically, for example by the well known Merrifield 
solid-phase synthesis procedure. 

30 Once recombinant DNA cloning vehicles and/or host cells expressing a desired 

Helicobacter polypeptide of this invention have been constructed by transforming or 
transfecting such cloning vehicles or host cells with plasmids containing the 
corresponding Helicobacter nucleic acid sequence, cloning vehicles or host cells are 
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cultured under conditions such that the polypeptides are expressed. The polypeptide is 
then isolated substantially free of contaminating host cell components by techniques 
well known to those skilled in the art. In a preferred embodiment for purifying the 
desired polypeptide one can first follow the standard techniques of lysing the host cells 
5 and then isolating the inclusion bodies by removing soluble proteins and other 

contaminants potentially. In a second step, it is preferred to solubilize the inclusion 
bodies in a zwitterionic detergent, which are well-known in the art. The detergent may 
be used with a denaturant; however, surprisingly, no denaturant is required. Finally, the 
solubilized inclusion body material, is purified using cationic exchange gel 
1 0 chromatography, followed by eluting with a salt solution to collect the purified 

polypeptide. This method generates a polypeptide which is at least about 75 or 80% 
pure, preferably at least about 90%. 

This invention also provides for a method of diagnosing an H. pylori infection 
1 5 comprising the step of determining the presence, in a sample, of an amino acid sequence 
EDDGFYTSVGYQIGEAAQMV (SEQ ID No.: 7), or preferably any of the isolated K 
pylori polypeptides of this invention. Any conventional diagnostic method may be 
used. These diagnostic methods can easily be based on the presence of an amino acid 
sequence or polypeptide. Preferably, such a diagnostic method matches for a 
20 polypeptide having at least 10, and preferably at least 20, amino acids which are 
common to the polypeptides of this invention. 

As noted, the present invention also relates to nucleic acid sequences encoding 
H. pylori polypeptides. The nucleic acid sequences disclosed herein can also be used for 

25 a variety of diagnostic applications. These nucleic acids sequences can be used to 

prepare relatively short DNA and RNA sequences that have the ability to specifically 
hybridize to the nucleic acid sequences encoding the polypeptides of this invention. 
Nucleic acid probes are selected for the desired length in view of the selected 
parameters of specificity of the diagnostic assay. The probes can be used in diagnostic 

30 assays for detecting the presence of pathogenic organisms in a given sample. With 

current advanced technologies for recombinant expression, nucleic acid sequences can 
be inserted into an expression construct for the purpose of screening the corresponding 
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oligopeptides and polypeptides for reactivity with existing antibodies or for the ability to 
generate diagnostic or therapeutic reagents. 

In preferred embodiments, the nucleic acid sequences employed for 
5 hybridization studies or assays include sequences that are complementary to a 

nucleotide stretch of at least about 10 to about 20 nucleotides, although at least about 10 
to 30, or about 30 to 60 nucleotides can be used. Nucleotide stretches of at least 10 
nucleotides are beneficial for providing stability and selectivity when testing a clinical 
sample for Helicobacter infection. A variety of known hybridization techniques and 

1 0 systems can be employed for practice of the hybridization aspects of this invention, 
including diagnostic assays such as those described in Falkow et al. s US Patent 
4,358,535. Depending on the application, one will desire to employ varying conditions 
of hybridization to achieve varying degrees of selectivity of the probe toward a target 
sequence. For applications requiring a high degree of selectivity, one will select 

1 5 relatively low salt and/or high temperature conditions, such as provided by 0.02M- 

0.1 5M NaCl at temperature of about 50°C to 70°C. These conditions are particularly 
selective, and tolerate little, if any, mismatch between the probe and the template or 
target strand. 

20 For some applications, if one desires to prepare mutants employing a mutant 

primer strand hybridized to an underlying template, less stringent hybridization 
conditions are called for in order to allow formation of the heteroduplex. The conditions 
may be altered by using 0.15M-0.9M salt, at temperatures ranging from about 20°C to 
about 55°C. In general, it is appreciated that conditions can be rendered more stringent 

25 by the addition of increasing amounts of formamide, which serves to destabilize the 
hybrid duplex in the same manner as increased temperature. Thus, hybridization 
conditions can be readily manipulated, and the method of choice will generally depend 
on the desired results. 

30 In certain embodiments, one may desire to employ nucleic acid probes to isolate 

variants from clone banks containing mutated clones. In particular embodiments, 
mutant clone colonies growing on solid media which contain variants of an K pylori 
polypeptide sequence could be identified on duplicate filters using hybridization 
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conditions and methods, such as those used in colony blot assays, to obtain 
hybridization only between probes containing sequence variants and nucleic acid 
sequence variants contained in specific colonies. In this manner, small hybridization 
probes containing short variant sequences of the K pylori genes of the invention may be 
5 utilized to identify those clones growing on solid media which contain sequence variants 
of the entire genes encoding polypeptides of 75, 77 and 79 kDa as discussed herein. 
These clones can then be grown to obtain desired quantities of the variant nucleic acid 
sequences or the corresponding antigen. 

1 0 In clinical diagnostic embodiments, nucleic acid sequences of the present 

invention are used in combination with an appropriate means, such as a label, for 
determining hybridization. A wide variety of appropriate indicator means are known in 
the art, including radioactive, enzymatic or other Iigands, such as avidin/biotin, which 
are capable of giving a detectable signal. In preferred diagnostic embodiments, one will 

1 5 likely desire to employ an enzyme tag such as urease, alkaline phosphatase or 

peroxidase, instead of radioactive or other environmental undesirable reagents. In the 
case of enzyme tags, colorimetric indicator substrates are known which can be 
employed to provide a means visible to the human eye or spectrophotometrically, to 
identify specific hybridization with pathogen nucleic acid-containing samples. 

20 

In general, it is envisioned that the hybridization probes described herein will be 
useful both as reagents in solution hybridizations as well as in embodiments employing 
a solid phase. In embodiments involving a solid phase, the test DNA (or RNA) from 
suspected clinical samples, such as exudates, body fluids (e.g., amniotic fluid, middle 

25 ear effusion, bronchoalveolar lavage fluid) or even tissues, is absorbed or otherwise 

affixed to a selected matrix or surface. This fixed, single-stranded nucleic acid is then 
subjected to specific hybridization with selected probes under desired conditions. The 
selected conditions will depend on the particular circumstances based on the particular 
criteria required (depending, for example, on the G+C contents, type of target nucleic 

30 acid, source of nucleic acid, size of hybridization probe, et.). Following washing of the 
hybridized surface so as to remove nonspecifically bound probe molecules, specific 
hybridization is detected, or even quantified, by means of the label. 
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The nucleic acid sequences which encode for the H. pylori polypeptides of the 
invention, or their variants, may be useful in conjunction with PCR™ technology as set 
out, e.g., in U.S. Patent 4,603,102, one may utilize various portions of any of K pylori 
sequences of this invention as oligonucleotide probes for the PCR™ amplification of a 
5 defined portion of an H. pylori (75, 77 kDa) sequence may then be detected by 

hybridization with a hybridization probe containing a complementary sequence. In this 
manner, extremely small concentrations of H. pylori nucleic acid may be detected in a 
sample utilizing the nucleic acid sequences of this invention. 

1 0 The following examples are included to demonstrate preferred embodiments of 

the invention. It should be appreciated by those skilled in the art that the techniques 
disclosed in the examples which follow represent techniques discovered by the inventors 
to function well in the practice of the invention, and thus can be considered to constitute 
preferred modes for its practice. However, those of skill in the art should, in the light of 

1 5 the present disclosure, appreciate that many changes can be made in the specific 
embodiments which are disclosed and still obtain a like or similar result without 
departing from the spirit and scope of the invention. 

EXAMPLES 

20 

Example 1 

1.1 Bacterial strains. K pylori strains PBCC 1101, PBCC 1 102, PBCC 1 103, 
PBCC 1 105, and PBCC 1 107 were isolated from human gastric biopsies obtained from 
the University of Rochester School of Medicine and Dentistry (Rochester, NY). H. 

25 pylori strains LET 13 and RSD 14 were isolated from human gastric biopsies obtained 
from the Syracuse Veterans Administration Medical Center (Syracuse, NY). H. pylori 
strains MH 60 EG 52, RJ 17, LJ 63, and MJ 34 were obtained as frozen stocks from the 
Clement J. Zablocki VA Medical Center (Milwaukee, WI). K pylori strain SSI was 
originally isolated from a human gastric biopsy, and subsequently adapted to infect mice 

30 (obtained from A. Lee, University of New South Wales, Sydney, Australia). H. pylori 
strains obtained from American Type Culture Collection were ATCC 43504 and ATCC 
43579. The H. felts strain was obtained from T. Blanchard, Case Western Reserve 
University, Cleveland, OH. 
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1.2 Culturing of Helicobacter strains. Cultures of H. pylori and H. felis were 
grown at 37°C on Columbia broth agar plates with 10% defibrinated horse blood and 10 
(ig/ml vancomycin in a microaerophilic chamber. Liquid cultures of H. pylori strains 

5 were grown at 37°C in BHI medium with 4% fetal calf serum and 10 jag /ml 
vancomycin in flasks infused with a gas mixture of 10% CCV 6% 0 2 f 84% N2 
(vol/vol/vol). 

1.3 Preparation of H. pylori PBCC 1103, Strain PBCC 1 103 was grown as above. 
1 0 The cell pellet was washed twice in a phosphate buffered saline (PBS) solution, and 

resuspended in a 0.3% formaldehyde solution for 1 hr. The fixed cells were then 

washed in and resuspended in PBS to an O.D. 60 o of 0.1 (approximately 10 7 cfu). 

1.4 Injection of Mice with H. pylori PBCC 1103. Ten 6 to 8 wk old female 

1 5 BALB/c mice were primed by interperitoneal injection with ca. 10 formalin fixed 

PBCC 1 103 cells at weeks 0, and boosted at weeks 2, 4 and 8. After a 34 week rest 

7 

period a pre-fusion boost of approximately 10 formalin fixed PBCC 1 103 was given 
interperioneally at week 42. 

20 1.5 Production of Monoclonal Antibodies, Hybridoma Techniques and 

Screening Procedures. During immunization and the rest period, mouse sera were 
obtained (week 6, 10) and tested for antibody activity by ELISA by using air-dried, 
formalin fixed PBCC 1 103 cells, 0.1 ml per well at an absorbance of 0.100 (A^o) as tne 
coating antigen. 

25 

Spleens were recovered from five immunized mice about 72 hours after the last 
injection, and were combined with nonsecreting X63Ag8.653 mouse (BALB/c) 
myeloma cells in 5:1 ratio (splenocytes:myeloma). The cells were fused for four 
minutes in 50% (v/v) polyethylene glycol 1500 and 10% dimethylsulfoxide in 
30 Dulbecco's Modified Eagle medium (D-MEM). The fused cells were diluted in 

selection medium, D-MEM supplemented with hypoxanthine, aminopterin, thymidine, 
10% fetal bovine serum and 10% NCTC-109 media supplement (Gibco-BRL). The 
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fusion efficiency (wells with colony growth vs. number of wells seeded) was 100% 
(900/900). 

Primary screening was completed by ELISAs using PBCC 1 103 whole cell 
5 antigen as described above, partially purified protein mixture with urease and heat shock 
protein(s) and lipopolysaccharide (LPS) purified from PBCC 1 103 cells. Secondary 
selection was completed by SDS-PAGE Western blot using heterologous ATCC 43579 
whole cell lysate as antigen. Positive reactors were identified and designated 1 through 
90, coded as hybridoma (Hpy) and saved for further characterization. 

10 

Selected hybridomas of interest were subcloned once by limiting dilution 
procedure. Monoclonal antibodies were provided as tissue culture supernatant (TCS), 
concentrated by 50% saturated ammonium sulfate precipitation (SAS-TCS) or ascites. 

1 5 1.6 SDS-PAGE and Western Blot Analysis, 

SDS-PAGE was carried out as described by Laemmli using 10-18% (w/v) 
acrylamide gels (Zaxis, Hudson, OH). Proteins were visualized by staining the 
gels with Coomassie ProBlue (Owl Separation Systems, Woburn, MA). Gels 
were scanned using a Personal Densitometer SI (Molecular Dynamics Inc., 

20 Sunnyvale, CA) and molecular weights estimated using the Fragment Analysis 
software (version 1.1) and prestained molecular weight markers obtained from 
Owl Separation Systems. Transfer of proteins to polyvinylidene difluoride 
(PVDF) membranes was accomplished using a semi-dry electroblotter and 
electroblot buffers (Owl Separation Systems). Membranes were probed with the 

25 indicated antisera followed by goat anti-mouse or anti-rabbit alkaline phosphatase 
conjugate as the secondary antibody (BioSource International, Camarillo, CA). 
Western blots were developed using the BCIP/NBT Phosphatase Substrate . 
System from Kirkegaard and Perry Laboratories (Gaithersburg, MD). 

30 1.7 Conservation of the 75 and 77 kDa Proteins in H. pylori Strains. 

Tissue culture supernatants from hybridomas were analyzed by whole cell ELISA 
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and antigen ELISA, followed by Western analysis using whole cell lysates of a 
number of H. pylori strains. Parent hybridoma designated Hpy 64 was 
determined by these methods to react to proteins in the range of approximately 
75-79 kDa. Subsequent cloning produced the monoclonal antibody (MAb) 
5 designated Hpy 64-27, which has reactive epitopes in 10 of 12 H. pylori strains 
tested, but not in K felis (Table 1). 
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Table 1 

Reactivity of MAb Hpy64-27 with Heterologous K pylori 
Strains and an K felis strain (Conservation of 75/77 kDa Proteins) 



Strain 

ATCC 43579 
PBCC 1101 
PBCC 1102 
PBCC 1103* 
PBCC 1105 
MH60 
SSI 



Reactivity 

++ 
++ 
+/- 
++ 



Strain 

EG52 

RJ17 

LJ63 

MJ34 

PBCC 1107 
H. felis** 



Reactivity 



+/- 

+ 



* -homologous isolate 
++ : strongly reactive 
+/- : slightly reactive 



** non-pylori 
+ : reactive 

- : no detectable reactivity 



1 0 1.8 Purification of the 75 and 77 kDa Proteins. Bacterial cells (ca. 10 g wet 
wt ofK pylori ATCC 43579) were resuspended in 70 ml of 0.05 M HEPES / 10 
mM EDTA / 1.0 mM PMSF (pH 7.0) by homogenization using a Tekmar Ultra- 
Turrex tissue homogenizes The cells were disrupted by sonication using a 
Branson Sonifier Cell Disrupter. The disrupted cells, including the membrane 

1 5 fraction, were pelleted by centrifugation at 42,000 rpm using a Beckman 70Ti 
rotor for 40 min at 4°C. Following centrifugation, the pellet was resuspended in 
70 ml of 0.01 M HEPES / 1.0 mM MgCl 2 / 1.0 mM PMSF / 1.0% TX-100 (pH 
7.4) and stirred for 1 hr at room temp. The suspension was centrifuged at 42,000 
rpm using a Beckman 70Ti rotor for 40 min at 4°C. Following centrifugation, the 

20 pellet was resuspended in 70 ml of 0.05 M Tris-HCl / 1 0.0 mM EDTA / 1 .0% 
ZWITTERGENT™3-14 (pH 7.4) and stirred for 1 hr at room temp. The 
suspension was then centrifuged at 42,000 rpm using a Beckman 70Ti rotor for 
40 min at 4°C. Following centrifugation, the supernatant containing the 75 and 
77 kDa proteins was collected and stored at -20°C for further purification. 
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The ZWITTERGENT™ 3-14 crude extract was buffer exchanged by passage 
over a 250 ml Sephadex G-25 (coarse) column (Pharmacia) equilibrated in 0.02 M Tris- 
HC1 / 5.0 mM EDTA / 1 .0% ZWITTERGENT™3-14 (pH 8.0). Approximately one half 
(35 ml) of the buffer exchanged ZWITTERGENT™ 3-14 crude extract preparation was 
5 loaded onto a 10 ml Pharmacia SP Fast Flow Sepharose column equilibrated 0.02 M 
Tris-HCl / 5.0 mM EDTA / 1 .0% ZWITTERGENT™ 3-14 (pH 8.0). Unbound protein 
was washed through the column with an additional 2 bed volumes of equilibration 
buffer. The 75 and 77 kDa proteins were co-eluted using a linear NaCI gradient (0-15 M 
NaCl) in 0.02 M Tris-HCl / 5.0 mM EDTA / 1.0% ZWITTERGENT™ 3-14 (pH 8.0). 

1 0 Fractions were screened for the 75 and 77 kDa proteins by SDS-PAGE / Western and 
pooled. Twelve ml of pooled co-eluting 75 and 77 kDa was subsequently applied to a 
500 ml Pharmacia Superose-12 column equilibrated in phosphate buffered saline (PBS) 
/ 1.0% ZWITTERGENT™ 3-14. Fractions were screened for the co-eluting 75 and 77 
kDa proteins by SDS-PAGE / Western and pooled. Material from two independent SP 

1 5 Sepharose / Superose-12 runs was combined and the co-eluted protein was precipitated 
by the addition of 9 volumes of ethanol overnight at -20°C. The resulting suspension 
was then centrifuged at 9,000 rpm using a Beckman SS3 1 rotor for 30 min at 4°C and 
the pellet was resuspended in 10 ml of PBS / 1 .0% ZWITTERGENT™ 3-14. Purified 
75 and 77 kDa proteins were stored at -20°C. The protein isolated using this protocol 

20 exhibited two bands on SDS-PAGE corresponding to subunit molecular masses of 75 
kDa and 77 kDa (Figure 1 A). 

Protein Estimation. Protein concentration was estimated by the BCA assay 
from Pierce (Rockford, IL) using BSA as a standard. 

25 

1.9 Reactivity of MAb 64-27 clone with 75 kDa and 77 kDa. Western blot 
analysis of co-purified 75 kDa and 77 kDa proteins revealed that only the 75kDa 
protein reacts strongly with the MAb 64-27 (Figure IB). 

30 Example 2 

2.1 Production in Mice of Polyclonal Antisera to Purified 75 kDa and 77 kDa 
Proteins. Two groups of 10 Swiss Webster mice were used for production of 
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polyclonal antisera to the purified 75 kDa and 77 kDa proteins. One group was injected 
with 25 jag of the purified protein with incomplete Freunds adjuvant (IFA) on weeks 0 
and 4. The other group was injected with 25 \xg of the purified proteins with 50 p,g of 
MPL™ on weeks 0 and 4. Both groups were bled on week 6. 

5 

2.2 Surface Labeling by Immunoelectron Microscopy (IEM). Twenty microliter 
droplets of live PBCC 1 105 in broth were placed on parafilm. Three Hundred mesh 
gold carbon-coated formvar grids were placed film side down on the droplets. Grids 
with cells were then rinsed in droplets of PBS/BSA buffer and transferred to PBS 

1 0 containing 1% cold water fish gelatin. After blocking, grids were incubated in mouse 
polyclonal antisera diluted 1 :50 in PBS/BSA for 1 hr, then rinsed in buffer. Grids were 
then incubated on a fifty-fold dilution of Nanogold (Nanoprobes, Inc., Stony Brook, 
NY), washed in buffer, and fixed with a solution of 1% gluteraldehyde in PBS. The 
fixative was removed from the grids with deionized water. The HQ silver enhancement 

1 5 kit (Nanoprobes, Inc.) was used to nucleate the Nanogold. The grids were then stained 
with Nanovan (Nanoprobes, Inc.) and viewed on a Zeiss 10C transmission electron 
microscope operating at 100 Kv. The polyclonal antiserum obtained from injecting 
mice with live strain PBCC 1 103 was able to label the surface of heterologous strain 
PBCC 1 105. As seen in Figure 2, the 75/77 kDa proteins are surface localized and that 

20 the reactive epitopes are present in both K pylori strains. 

23 Flow Cytometry. Approximately 1 0 6 - 1 0 7 cells of PBCC 1 105 (O.D. 600 range 
of 0.2 to 1 .6) were washed once in PCM buffer (10 mM NaP0 4 , pH 7.4, 150 mM NaCl, 
0.5 mM MgCl 2 , 0,15 mM CaCl 2 ). The cell pellet was gently resuspended in 100 ml 
25 PCM buffer. One microliter of polyclonal antisera was added and the cells incubated at 

o 

37 C for 30 min on a rocker. Cells were diluted with 900 \xl PCM buffer, mixed gently 
and pelleted at maximum speed in a microcentrifuge for 1 min. The pellet was gently 
resuspended in 100 |il of PCM buffer. One microliter of Oregon Green 514 conjugated 
goat anti-mouse antibody (Molecular Probes) was used to fluorescently label the cells 
30 tagged with the polyclonal antisera at 37°C for 30 min. The cells were then washed as 
described above and diluted in 1 ml PCM for analysis on the FACSort (Becton- 
Dickenson). Sera obtained from mice prior to injection with the mixture of the 75 kDa 
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and 77 kDa proteins on day 0 showed only background labeling in all five strains tested, 
with a mean fluorescent intensity of less than 10 (see Figure 3 A). Polyclonal antisera 
obtained from the same mice on day 49 following two injections of the protein mixture 
displayed a positive fluorescent peak with all strains tested (strains ATCC 43579, PBCC 
5 1105, ATCC 43504 and SS-1 : See Figure 3B). The mean fluorescent intensity ranged 
from 36.44 for SSI to 322.68 from the homologous strain ATCC 43579. This 
demonstrates that there are conserved epitopes among these strains. Without being 
bound by theory, the difference in the fluorescent intensities could be a function of the 
number of antigen molecules expressed on the cell surface or the antigenic variability of 
1 0 the epitopes among the heterologous strains. 

Example 3 

3.1 Assay of Bactericidal (BC) Activity of Polyclonal Antisera. Target cells of 
H. pylori strain PBCC 1 105 for the BC assay were provided on the morning of the 

1 5 assay. The optical density of the liquid culture at 600 nm was approximately 0. 1 . Cells 
pelleted from 10 ml of this liquid culture were used to adsorb any nonspecific 
bactericidal activity in the complement source (human complement). Cells and serum 
were allowed to react for at least 1 hour on ice with occasional agitation. After 
incubation, the serum and cell mixture was centrifuged, and the serum was removed and 

20 placed on ice until needed. The BC assay was performed when the liquid culture 

attained an O.D. 60 o of approximately 0.3. These cells were diluted in PCM buffer to a 

concentration of 10 6 cfu per ml . The reaction consisted of 10 |Ltl strain PBCC 1 105 at a 
concentration of 10 6 cfu per ml, 10 \x\ adsorbed serum as a complement source, 5 |il of 
diluted polyclonal antisera (heat inactivated 60°C for 10 min), and 25 ^1 of PCM. The 

25 reaction was incubated at 36°C in a 10% C0 2 chamber for 30 min. Two hundred (il of 
PCM was then added to the reaction and duplicate 50 pal aliquots were plated on 
Columbia agar with 10% defibrinated horse blood and 10 (ig/ml vancomycin. Plates 
were incubated in a microaerophilic chamber for at least 72 hours, and counted on an 
automated plate reader. Polyclonal antisera from mice injected with the 75/77 kDa 

30 protein mixed with Incomplete Freunds Adjuvant (1FA) had a BC 50 at a dilution of 

greater than 1/3200 and 75/77 kDa protein mixed with MPL™ had a BC 50 at a dilution 
of greater than 1/3200 , as compared to 1/800 in serum from control mice (see Figure 4, 
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complete data for MPL is not shown). This is another indication that the epitopes of the 
75/77 kDa proteins are surface exposed. 

Example 4 - Immunization and Challenge Tests 
5 4.1 Immunizations. C57BI/6 mice were vaccinated intragastrically on days 0, 2, 
14, and 16 with 100 fig recombinant K pylori (Hp) urease (rUre) or native Hp 75/77 
kDa protein formulated with 10 jig cholera toxin (CT); or were vaccinated 
subcutaneously on days 0 and 16 with 10 fig recombinant Hp rUre or native Hp 75/77 
kDa protein formulated with 100 jag Aluminum Phosphate (A1P0 4 ). Control mice 
1 0 received no vaccine. 

4.2 Reduction of Colonization in the Mouse Model. Infection was established by 
the intragastric deposition of one 0.1 ml inoculum of H. pylori strain SSI on day 37. 

8 

The challenge inocula contained a suspension of H. pylori at a concentration of 2 x 10 

1 5 cfu per ml. The amount of viable K pylori in the stomachs of mice was determined 2, 4 
and 8 weeks following challenge (days 50 and 64). At each time point, 5 mice were 
sacrificed by cervical dislocation, their stomachs harvested and split into 2 longitudinal 
sections. One section from each mouse was homogenized and the homogenate diluted 
and plated on Columbia agar containing defibrinated horse serum and the appropriate 

20 antibiotics. Plates were incubated at 37 C in a microaerophilic incubator for 5 days. 

After incubation the number of viable H. pylori isolated were enumerated. Total cfu per 
gram of stomach tissue were obtained by multiplying colony numbers per plate by the 
dilution factor and dividing by the weight of the sectioned stomach; numbers were 
transformed and are expressed as log 10 cfu per gram homogenized stomach tissue + 1 

25 standard error of the mean. Mice which were vaccinated intragastrically with the 75/77 
kDa protein mixture admixed with CT showed a significant reduction in colonization 
following challenge with Hp strain SSI, as compared to non-vaccinated controls. At 
four weeks post-challenge, using either oral or parenteral routes of immunization, the 
number of colony forming units recovered per gram of stomach tissue was reduced by 

30 approximately 1-2 logs in vaccinated mice (see Figure 5). This result is comparable to, 
if not improved over, the reduction that is afforded by vaccination with recombinant 
urease admixed with CT. 
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Example 5 - Protein Analysis 

5.1 Enzymatic Cleavage of the 75 kDa and 77 kDa Proteins. 

(i) Trypsin cleavage. Approximately 0.5 mg of the 75 kDa and 77 kDa protein 
5 mixture was precipitated with 90% (v/v) ethanol and the pellet resuspended in a total 
volume of 1 ml of phosphate-buffered saline (PBS) / 0.1% ZWITTERGENT™ 3-14. 
Ten microliters of a 0.5 mg/ml solution of trypsin was added (Boehringer Mannheim, 
Indianapolis, IN), and the reaction mixture incubated for 4 hr at 37°C. 

1 0 (ii) Elastase cleavage. Approximately 0.5 mg of the 75 kDa and 77 kDa protein 

mixture was precipitated with 90% (v/v) ethanol and the pellet resuspended in a total 
volume of L0 ml of PBS / 0.1% ZWITTERGENT 3-14. Five microliters of a 1.0 mg/ml 
solution of elastase was added (Worth ington), and the reaction mixture incubated for 4 
hrat37°C. 

15 

(iii) Endoproteinase Lys-C cleavage. Approximately 0.5 mg of the 75 kDa 
and 77 kDa protein mixture was precipitated with 90% (v/v) ethanol and the pellet 
resuspended in a total volume of 1.0 ml of PBS / 0.1% ZWITTERGENT 3-14. This 
preparation was added directly to a vial containing 5 mg of endoproteinase Lys-C 

20 (Boehringer Mannheim). The reaction mixture was incubated for 4 hr at 37°C. 

(iv) Separation of peptides. The cleavage reaction mixtures from (i) -(iii) 
above were centrifuged in an Eppendorf centrifuge at 12,000 rpm for 5 min and the 
supernatant loaded directly onto a Vydac Protein C4 HPLC column (The Separations 

25 Group, Hesperia, CA). The solvent system used consisted of 0.1% (v/v) aqueous 

trifluoroacetic acid (TFA), (Solvent A) and acetonitrile : H 2 0 : TFA, 80:20:0.1 (v/v/v) 
(Solvent B) at a flow rate of 1.0 ml/min. Following initial wash with Solvent A, the 
peptides were eluted with a linear gradient from 0-100% Solvent B in 30 min and 
detected by absorbance at 220 nm. Suitable fractions were collected, dried down in a 

30 Speed-Vac concentrator (Jouan Inc., Winchester, VA) and subsequently resuspended in 
distilled water. The fractions were subjected to SDS-PAGE using 1 0- 1 8% (w/v, 
acrylamide) gradient gels (Owl Separation Systems) in a Tris-Tricine buffer system 
(Schogger and von Jagow). The fractions exhibiting a single peptide band were 
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submitted for N-terminal sequence analysis. Fractions displaying multiple peptide 
bands in SDS-PAGE were electrophoretically transferred onto a PVDF membrane as 
described above. The membrane was stained with Coomassie Brilliant Blue R-250 and 
individual bands excised and submitted for N-terminal sequence analysis (Matsudaira), 
5 as discussed in Example 6 below. 

5.2 Estimation of Molecular Weight By MALDI-TOF Mass Spectrometry. 

Determination of molecular weight by Matrix Assisted Laser Desorption / Ionization - 
Time of Flight (MALDI-TOF) mass spectrometry (Hillenkamp and Karas) was carried 

1 0 out using a Lasermat 2000 Mass Analyzer (Finnigan Mat, Hemel Hempstead, UK) with 
3,5-dimethoxy-4-hydroxy-cinnamic acid as the matrix. For samples containing 1.0% 
ZWITTERGENT™ 3-14, cold ethanol precipitation was carried out twice to remove the 
detergent using a 90% (v/v) final ethanol concentration followed by solubilization of the 
precipitated protein in water.. MALDI-TOF analysis using 3,5-dimethoxy-4-hydroxy- 

1 5 cinnamic acid matrix in presence of 70% (v/v) aqueous acetonitrile / 0. 1% TFA resulted 
in the identification of predominantly two species with average molecular weights of 
75,572 kDa and 77,633 kDa. 

Examples 6 - N-terminal Sequence Analysis 

20 6.1 Amino Acid Sequence Analysis. N-terminal sequence analysis was carried out 
using an Applied Biosystems Model 477 A Protein/Peptide Sequencer equipped with an 
on-line Model 120A PTH Analyzer (Applied Biosystems, Foster City, CA). The 
phenylthiohydantoin (PTH) derivatives were identified by reversed-phase HPLC using a 
Brownlee PTH C- 18 column (particle size 5 mm, 2.1 mm i.d. x 22 cm I.; Applied 

25 Biosystems). 

6.2 N-terminal Sequence Analysis of Both Intact 75 kDa and 77 kDa Proteins 
and Internal Peptides. Determination of the N-terminal sequence of both 75 kDa and 
77 kDa from intact proteins blotted onto PVDF exhibited identical sequence through the 
30 first 20 amino acids, EDDGFYTSVGYQIGEAAQMV (SEQ. ID No.: 7) of the mature 
processed form of these proteins, i.e. after cleavage of their signal peptides. This 
sequence was shown to match the N-terminus of three translated gene sequences 
identified from the K pylori genome (Note; the third gene corresponds to a 79 kDa K 
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pylori genome (Note; the third gene corresponds to a 79 kDa H. pylori protein as noted 
in Example 7). In order to determine which of the three genes encode the 75 kDa and 
77 kDa proteins, internal peptide fragments were generated from the mixture and 
subjected to N-terminal sequence analysis. Table 2 shows the N-terminal sequences 
5 obtained for both the intact proteins as well as fragments generated from the digestion of 
the mixture as described in Example 5. Sequence matches with the primary sequence 
deduced from the respective gene sequence are also indicated for each fragment. The 
data suggests that the mixture includes the 75 kDa and 77 kDa gene products because 
there is sequence identity specific to the respective genes. This result, in turn, is 
1 0 consistent with the mass results obtained by MALDI-TOF. 

Table 2 
Fragment Sequence Summary 
(positions are listed starting from the first amino acid of the 
leader sequence) 



Sequence 


75 kDa 


77 kDa 


79 kDa 


SEQ ID 
No. 


EDDGFYTSVGYQIGEAAQMV 


21-40 


21-40 


21-40 


34 


EDDGFYTSVGYQIGEAAQMVK 


21-41 






35 


STSSTTIFNNEPGYR 




135-149 




36 


TGGKPN-P — WS 


215-228 






37 


TTTQTIDGK 




226-234 




38 


NSIAHFGTQE-QI 


422-434 


447-459 


459-471 


39 


VPNAQSLQNVVSK 


468-481 


493-506 


505-517 


40 


SKKNNPYSPQGIET 


479-492 


504-517 


516-529 


41 


NYYLNQN 


493-499 


518-524 


530-536 


42 



Example 7 - Identification of the 75 kDa, 77 kDa and 79 kDa Genes in the 
Chromosome of H. pylori. 
30 7.1 Genetic Methods. Isolation of plasmid and chromosomal DNA, agarose gel 
electrophoresis and restriction enzyme digestion were performed following standard 

35 



Printed:p6-64|20Ga 



WO 00/00614 



PCT/US99/14375 



protocols (Sambrook et al.). DNA ligation was performed using a TOPO ligation kit 
(Invitrogen) following the recommended protocol. 

7.2 Oligonucleotide Synthesis. Single stranded PCR primers were synthesized on 
5 an Applied Biosystems model 380B DNA synthesizer using B-cyanoethyl 

phosphoramidite chemistry. 

7.3 DNA Sequencing. DNA sequencing was obtained by asymmetric PCR 
amplification using the fluorescent dye-labeled dideoxynucleotide terminator method 

1 0 (Gyllensten et al., 1988). The dye-labeled single stranded DNA fragments were 

separated and identified with an Applied Biosystem model 3 73 A automatic sequencing 
apparatus. Primary sequence information was analyzed using MacVector DNA analysis 
program (IBI, New Haven CT). 

1 5 7.4 PCR Amplification. PCR amplifications were performed in 500 jlxI tubes 

containing 100 |il reaction volume overlaid with mineral oil. Each reaction contained 
10 mM Tris-HCL, pH 8.3, 50 mM KC1 and L5 mM MgCl 2 ; 2.5 units Taq polymerase 
(Boehringer Mannheim), 200 mM dNTPs; 20 mM oligonucleotide primers and 1-2 |ig 
chromosomal DNA. Templates were amplified for 30 cycles with a 1 second extension 

20 on the annealing time of each cycle in a Perkin-EImer Cetus thermocycler. 

7.5 Search Programs. DNA alignment searches were performed using the 
MacVector program (IBI, New Haven CT). Protein homology searches were performed 
using either MacVector or DNASTAR (DNASTAR, Inc.) clustal alignment programs. 

25 

7.6 Identification of Genome Homology to a Degenerate Oligonucleotide. The 

K pylori genome (unannotated) was downloaded from TIGR (The Institute for Genomic 
Research) ftp file and stored as a MacVector file. The N-terminal amino acid sequence 
of the 77 kDa H. pylori protein was entered into MacVector program and reverse 
30 translated to give the degenerate DNA oligonucleotide which was aligned to the 

genome. The oligonucleotide used to align to the H. pylori genome had the following 
sequence: 
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5'GARGAYGAYGGNTTYTAYACNWSNGTNGGNTAYCARATHGGNGAR 
GCNGCNCARATGGTN 3' [SEQ. ID NO. 33], 
Three matches were observed, corresponding to nucleotide positions 1404865-1404807, 
722946-722888 and 352616-352558, all on the negative strand of the genome. 
5 MacVector was used to outline open reading frames which would overlap the above 

regions in the genomic DNA. Three open reading frames based on translational start and 
stop codons were found. An open reading frame between nucleotides 1402991-1404925 
(negative strand) predicted a protein of 81 ,157 kDa. There was a putative signal 
sequence of 20 amino acids which when cleaved would create a protein of 79 kDa 
1 0 starting with the amino acids EDDGFYTSVGYQIGEAAQMV (SEQ ID No.7). A 
second open reading frame between nucleotides 720808-723006 (negative strand) 
predicted a protein of 79,123 kDa. There was a putative signal sequence of 20 amino 
acids which when cleaved would create a protein of 77 kDa starting with the amino 
acids EDDGFYTSVGYQIGEAAQMV (SEQ ID No.7). A third open reading frame 
1 5 between nucleotides 350550-352672 (negative strand) predicted a protein of 77,687 
kDa. There was a putative signal sequence of 19 amino acids which when cleaved 
would create a protein of 75 kDa starting with the amino acids 
EDDGFYTSVGYQIGEAAQMV (SEQ ID No.7). 



The three proteins have identical 20 amino acids at the N-terminus and share 
internal peptide residues empirically determined from purified protein preparations 
(compare SEQ ID Nos. 1, 2, 3 and 7). 



Example 8 

25 8.1 PCR Amplification and Cloning of the 77 kDa Gene. PGR primers were 

synthesized which hybridized to the translational start and stop codons of the ORF. The 
PCR primers synthesized to amplify the 77 kDa gene from the chromosome ofK pylori 
had the following sequences: Forward Primer: 5' GGC CAT ATG AAA AAA CAC 
ATC CTT TCA TTA GCT TTA GGC TCG 3' (SEQ ID No.: 8) and Reverse Primer: 5' 

30 GGC A AG CTT GGG AGT TTC AC A AAA AGC TTA GTA AGC G AA CAC 3 ' (SEQ 
ID No.: 9). Chromosomal DNA from strains ATCC 43504. ATCC 43579, PBCC 1 103, 
PBCC 1 105, PBCC 1 107, LET 13, RSD 14, and SSI were used as template DNA. 
Conditions on the first round of PCR were 1 sec at 95°C, 1 sec at 60°C, 2 sec at 72°C , 
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with a 5 min hot start at 95°C (30 cycles). Ten microliters electrophoresed on an 
agarose gel revealed a 2 kb DNA fragment for strains ATCC 43504, PBCC 1 103, PBGC 
1 107, and LET 13. The PCR was repeated under lower annealing temperatures (55°C) 
to amplify strains ATCC 43579 and PBCC 1 105. Under these conditions, 
5 amplifications with these DNA templates were observed to have a diffuse band at 2 kb. 
A 2 kb family of bands was also observed after PCR amplification of SS 1 DNA at 50°C 
annealing temperature. 

Two microliters of each PCR amplification was added to a TOPO ligation mix 
1 0 (Invitrogen) and transformed into Topi OF' competent cells. Ampicillin resistance 

colonies showing a white color on X-gal IPTG plates were picked and grown for mini 
plasmid preparations (three for each strain). A single transformant for each strain which 
was confirmed to have an insert of 2 kb was picked for large scale DNA preparation and 
sequence analysis. 

15 

Sequence analysis confirmed that the genes for the 77 kDa protein from strains 
ATCC 43504, ATCC 43579, PBCC 1 103, PBCC 1 105 and the 79 kDa protein from 
strains PBCC 1 107, LET 13, and SSI have been cloned successfully as described above. 

20 Examples on Cloning, recombinant expression, 

and purification of the 75, 77, and 79 kDa proteins 

Strains and growth conditions. E. coli strain BLR(DE3) pLysS was obtained from 
Novogen. E. coli strains TOP 10 and Topi OF' were obtained from Invitrogen. E. coli 
25 strains were grown in HYSOY buffer containing 1% glucose or glycerol. Amplicilin 
(100 jig/mL) and chloramphenicol (30 ug/mL) were added when appropriate. 

DNA Manipulations. Ligations into expression vectors were performed as 
described by Sambrook et al. 

30 

Example 9 

9.1 Cloning and expression of the 75 kDa gene from H. pylori strain ATCC43579. 

38 



WO 00/00614 



PCT/US99/14375 



PCR amplification, cloning and sequencing of 70 kDa family genes from 7 strains of H, 
pylori revealed genes for the 77K kDa and the 79 kDa genes, but not the 75 kDa gene. 
Alignments of the three gene sequences published by TIGR revealed that the ATG start 
region of 75 kDa was different from that of the 77 kDa and 79 kDa genes and would not 
5 be expected to be amplified by the primer pair A and B (Table 3) used to amplify the 77 
and 79 kDa genes. A primer corresponding to the 75 kDa gene start region could not be 
designed because a stretch of 1 1 CT dinucleotide repeats exists 10 bp downstream from 
the ATG start codon. CT dinucleotide repeats are also found in other H. pylori genes, 
and therefore a primer containing the repeats would not be specific for the 75 kDa gene. 

1 0 DNA primers were designed based on the predicted sequences in and surrounding the 75 
kDa gene sequence as published by TIGR. A primer was designed to anneal upstream 
from the 75 kDa coding region (Table 3, primer C). Attempts to PCR amplify the 75 
kDa gene from strain ATCC 43579 using primers B and C under a variety of annealing 
conditions did not yield a PCR product. In order to obtain the 75 kDa gene from strain 

1 5 ATCC 43579, two complimentary internal primers were made based on empirically 
determined internal peptide sequence (Table 3, primers F and G). The front and back 
halves of the 75 kDa gene were amplifed in separate PCR reactions using primer pairs D 
and G or E and F (Table 3), cloned and sequenced to verify that they were homologous 
to the predicted 75 kDa sequence. 

20 

Two additional primers (Table 3, H and I) were designed with BsmBI ends for seamless 
cloning to the two 75 kDa gene fragments. Upon PCR amplification of the front and 
back halves with primer pairs D and I or E and H, the resulting DNA fragments were 
digested with BsmBI and ligated to each other. The ligated DNA was amplified with 

25 primers D and E, corresponding to the ends of the mature open reading frame. A 2 kb 
DNA fragment was detected by ethidium bromide stained agarose gel electrophoresis. 
This DNA band was cloned in plasmid pCR2.1 (Invitrogen, San Diego, CA ) resulting in 
pPX7768 and sequenced to confirm that it was the 75 kDa gene (Figure 6, SEQ ID NO. 
21). The predicted protein translation from this sequence is shown in Figure 7, SEQ ID 

30 NO 19. 

The mature gene fragment was PCR amplified using primer J and K (Table 3) with 
pPX7768 as template DNA. The fragment was cloned into pRSETb at the 
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NdelmECoRV sites to produce plasmid pPX781 1. This plasm id was transformed into 
the expression strain BLR(DE3)pLysS, selecting for ampicilin and chloroamphenicol 
resistance. The strain was grown to mig-logarithmic phase in HYSOY broth( Difco) 
containing ampicilin and chloroamphenicol and expression was induced by the addition 
5 of ImM isopropanol beta-D-thiogaiactopyranoside (IPTG). Expression proceeded for 2 
hours, after which the cells were harvested by centrifugation. Whole cell lysates were 
run on 10%SDS-PAGE. The 75kDa protein band was overexpressed as determined by 
coomassie stained gels and reacted with mAB Hpy 64-27 on Western blots 

1 0 9.2 Purification of the recombinant 75 kDa protein 

Bacterial cells (ca. 15 g wet wt of E. coli BLR(DE3)pLysS / pPX781 1) were resuspended in 
75 ml of 10 mM NaPC>4 / 150 mM NaCl / 5.0 mM EDTA / 1.0 mM Pefabloc (pH 7.2) by 
homogenization using a Ultra-Turrex tissue homogenizer (IKA Works, Wilmington, NC). 
The cells were disrupted by sonication using a Branson Sonifier Cell Disrupter. Inclusion 

1 5 bodies containing the recombinant 75 kDa protein were isolated by centrifugation at 10,000 
rpm using a Sorvall SLA- 1500 rotor for 30 min at 10°C. Following centrifugation, the 
pellet was resuspended by homogenization in 75 ml of 10 mM NaPC>4 / 150 mM NaCl / 5.0 
mM EDTA / 4.0% TX-100 (pH 7.2) and stirred for 1.5 hr at 4°C. The suspension was 
centrifuged at 10,000 rpm using a Sorvall SS-34 rotor for 30 min at 10°C. Following 

20 centrifugation, the pellet was resuspended by homogenization in 75 ml of 10 mM NaPC>4 / 
150 mM NaCl / 5.0 mM EDTA / 1 .0% Zwittergent 3-14 (Z 3-14) (pH 7.2) and stirred for 1.5 
hr at room temp. The suspension was centrifuged at 10,000 rpm using a Sorvall SS-34 rotor 
for 30 min at 10°C. Following centrifugation, the pellet was resuspended by 
homogenization in 75 ml of 10 mMNaP04 / 150 mM NaCl / 5.0 mM EDTA / 1.0% 

25 Zwittergent 3-16 (Z 3- 1 6) (pH 7.2) and stirred for 2 hr at room temp. The suspension was 
centrifuged at 10,000 rpm using a Sorvall SS-34 rotor for 30 min at 10°C. Following 
centrifugation, the Z 3-14 and Z 3-16 supernatants containing the recombinant 75 kDa 
protein were collected and stored at -20°C for further purification. The Z 3-14 extract was 
buffer exchanged by passage over a 1 80 ml Sephadex G-25 (coarse) column (Pharmacia 

30 Biotech Inc., Piscataway, NJ) equilibrated in 0.02 M Tris-HCI / 5.0 mM EDTA / 1 .0% Z 3- 
14 (pH 8.0). The buffer exchanged Z 3-14 crude extract preparation was loaded onto a 10 
ml SP-Sepharose Fast Flow column (Pharmacia Biotech Inc.) equilibrated with 0.02 M Tris- 
HCI / 5.0 mM EDTA / 1 .0% Z 3-14 (pH 8.0). Unbound protein was washed through the 
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column with an additional 2 column volumes of equilibration buffer. The recombinant 75 
kDa protein was eluted using a linear NaCI gradient (0-0.5 M NaCl) in 0.02 M Tris-HCl / 
5.0 mM EDTA / 1.0% Z 3-14 (pH 7.0) over 8 column volumes. Fractions were screened for 
recombinant 75 kDa protein by SDS-PAGE and Western analysis and pooled. Pooled 
5 fractions were diaiyzed into PBS / 1 .0% Z 3- 1 4 overnight at 4°C. The Z 3- 1 6 extract was 
processed as described for the Z 3-14 extract. 

Example 10 

10.1 Cloning and expression of the 77 kDa gene from H. pylori strain ATCC 43579 

1 0 Based on the published DNA sequence from TIGR, DNA oligonucelotide primers 

corresponding to the gene start (5') and end (3') were designed (Table 3, primers A and 
B). Chromosomal DNA from K pylori strains ATCC 43579, ATCC 43504, PBCC 1 103 
and PBCC 1 105 was isolated and used as templates for PCR amplification. Following 
PCR amplification, a 2 kb band was cloned into PCR2.1 TA cloning vector, and the K 

1 5 pylori insert was sequenced. DNA sequences of the 77 kDa genes from strains ATCC 

43504, PBCC 1 103 and PBCC 1 105 were only partial but showed more homology to the 
77 kDa gene from TIGR than to the 75 kDa or 79 kDa genes. The 77 kDa gene from 
strain ATCC 43579 was completely sequenced (plasmid pPX7760b). The DNA 
sequence of the complete coding region is shown in SEQ ID NO. 5. The predicted 

20 protein translation from this sequence is shown in SEQ ID NO. 2. 

The 77 kDa gene was cloned into expression vector pRSETb which directs expression 
of the foreign protein by the T7 promoter. The mature gene fragment was PCR 
amplified using primers J and K (Table 3) with pPX7760b as template DNA. The band 

25 was cloned into PCR2.1 (pPX7792). pPX7792 was cleaved with Ndel and EcoRV, and 
the 2 kb gene fragment was ligated into pRSETb cut with the same enzymes. After 
verification of the correct clone by restriction endonuclease digestion, the resulting 
plasmid, designated pPX7796, was transformed into the expression strain 
BLR(DE3)pLysS, selecting for ampicillin and chloramphenicol resistance. The strain 

30 was grown to mid-logarithmic phase in HYSOY broth (Difco, Detroit, Mich.) 

containing ampicillin and chloramphenicol, and expression was induced by the addition 
of 1 mM IPTG. Expression proceeded for 2 hours, after which the cells were harvested 
by centrifugation. Whole cells lysates were run on 10-12% SDS-PAGE. The 77 kDa 
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protein band was overexpressed as determined by coomassie stained gels, and reacted 
with polyclonal serum raised against the native protein. 

10.2 Purification of the recombinant 77 kDa protein. 

5 Bacterial cells (ca. 32 g wet wt of E. coli BLR(DE3)pLysS (pPX7796) were resuspended in 
150 ml of 10 mM NaPC>4 / 150 mM NaCl / 5.0 mM EDTA /l.OmM Pefabloc (pH 7.2) by 
homogenization using a Ultra-Turrex tissue homogenizes The cells were disrupted using a 
Microfluidizer Model 1 10Y. Inclusion bodies containing the recombinant 77 kDa protein 
were isolated by centrifugation at 10,000 rpm using a Sorvall SLA- 1 500 rotor for 30 min at 

1 0 10°C. Following centrifugation, the pellet was resuspended by homogenization in 1 50 ml of 
lOmM NaPC>4 / 150mM NaCl / 5.0 mM EDTA / 1.0% TX-100 (pH 7.2) and stirred for 2 hr 
at 4°C. The suspension was centrifuged at 10,000 rpm using a Sorvall SS-34 rotor for 30 
min at 10°C. Following centrifugation, the pellet was resuspended by homogenization in 75 
ml of lOmM NaP04 / 150mM NaCl / 5.0 mM EDTA / 1 .0% Z 3-1 6 (pH 7.2) and stirred 

1 5 overnight at room temp. The suspension was centrifuged at 10,000 rpm using a Sorvall SS- 
34 rotor for 30 min at 10°C. Following centrifugation, the supernatant containing the 
recombinant 77 kDa protein was collected and stored at -20°C for further purification. The 
Z 3-16 extract was buffer exchanged by passage over a 180 ml Sephadex G-25 (coarse) 
column equilibrated in 0.02 M Tris-HCl / 5.0 mM EDTA / 1 .0% Z 3-14 (pH 8.0). The 

20 buffer exchanged Z 3-16 crude extract preparation was loaded onto a 10 ml SP-Sepharose 
Fast Flow column equilibrated 0.02 M Tris-HCl / 5.0 mM EDTA / 1.0% Z 3-14 (pH 8.0). 
Unbound protein was washed through the column with an additional 2 column volumes of 
equilibration buffer. The recombinant 77 kDa protein was eluted using a linear NaCl 
gradient (0-0.5 M NaCl) in 0.02 M Tris-HCl / 5.0 mM EDTA / 1.0% Z 3-14 (pH 7.0) over 8 

25 column volumes. Fractions were screened for recombinant 77 kDa protein by SDS-PAGE 
and Western analysis and pooled. Pooled fractions were dialyzed into PBS / 1.0% Z 3-14 
overnight at 4°C. 

Example 11 

30 11.1 Cloning and expression of the 79 kDa gene from H. pylori strain PBCC 1107 

The 79 kDa gene was cloned form 3 strains of//, pylori , PBCC 1 107, Let 13, and SSI . 
Primers J and K (Table 3) were used to amplify the mature coding region for cloning 
into pRSETb following the same protocol as for the 75 and 77 kDa constructs. 
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Following transformation into BLR(DE3) pLysS and induction by IPTG, the 
recombinant plasmid pPX5048 directed the overexpression of the 79 kDa protein. The 
recombinant protein was easily detected in coomassie stained gels and reacted with both 
polyclonal sera raised against native 75, and 77 kDa genes (due to its homology) and 
5 with peptide-conjugate antisera directed against a sequence unique to the 79 kDa protein 
(described below). 

11.2 Purification of the recombinant 79 kDa protein. 

Bacterial cells (ca. 15 g wet wt of E. coli BL21(DE3)pLysS / pPX5048) are resuspended in 

10 75 ml of 10 mM NaPC>4 / 150 mM NaCl / 5.0 mM EDTA /l.OmM Pefabloc (pH 7.2) by 
homogenization using a Ultra-Turrex tissue homogenizen The cells are disrupted by 
sonication using a Branson Sonifier Cell Disrupter. Inclusion bodies containing the 
recombinant 79 kDa protein are isolated by centrifugation at 10,000 rpm using a Sorvall SS- 
31 rotor for 30 min at 10°C. Following centrifugation, the pellet is resuspended by 

1 5 homogenization in 75 ml of lOmM NaP0 4 / 150mM NaCl / 5.0 mM EDTA / 4.0% TX-100 
(pH 7.2) and stirred for 1.5 hr at 4°C. The suspension is centrifuged at 10,000 rpm using a 
Sorvall SS-31 rotor for 30 min at 10°C. Following centrifugation, the pellet is resuspended 
by homogenization in 75 ml of lOmM NaP0 4 / 150mM NaCl / 5.0 mM EDTA / 1.0% Z 3- 
14 (pH 7.2) and stirred for 1 .5 hr at room temp. The suspension is centrifuged at 10,000 

20 rpm using a Sorvall SS-34 rotor for 30 min at 10°C. Following centrifugation, the pellet is 
resuspended by homogenization in 75 ml of 10 mM NaPC>4 / 150 mM NaCl / 5.0 mM 
EDTA / 1 .0% Z 3-16 (pH 7.2) and stirred for 2 hr at room temp. The suspension is 
centrifuged at 10,000 rpm using a Sorvall SS-3 1 rotor for 30 min at 10°C. Following 
centrifugation, the Z 3-14 and Z 3-16 supernatants containing the recombinant 79 kDa 

25 protein are collected and stored at -20°C for further purification. The Z 3-14 extract is 

buffer exchanged by passage over a 1 80 ml Sephadex G-25 (coarse) column equilibrated in 
0.02 M Tris-HCl / 5.0 mM EDTA / 1.0% Z 3-14 (pH 8.0). The buffer exchanged Z 3-14 
crude extract preparation is loaded onto a 10 ml SP-Sepharose Fast Flow column 
equilibrated 0.02 M Tris-HCl / 5.0 mM EDTA / 1.0% Z 3-14 (pH 8.0). Unbound protein is 

30 washed through the column with an additional 2 column volumes of equilibration buffer. 
The recombinant 79 kDa protein is eluted using a linear NaCl gradient (0-0.5 M NaCl) in 
0.02 M Tris-HCl / 5.0 mM EDTA / 1 .0% Z 3-14 (pH 7.0) over 8 column volumes. Fractions 
are screened for recombinant 79 kDa protein by SDS-PAGE and Western analysis and 
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pooled. Pooled fractions are dialyzed into PBS / 1 .0% Z 3- 14 overnight at 4°C. The Z 3-16 
extract is processed as described for the Z 3-14 extract. 

Example 12 

5 DNA Sequence and Predicted Translation of 75, 77 and 79 kDa genes 

The DNA sequence of the 75 kDa gene from strain ATCC 43579 corresponding to the 
mature protein is shown in Figure 6, SEQ ID NO. 21 . The open reading frame is 2070 
bp long, the same as the TIGR predicted protein. The translated protein is 689 aas long 
containing 6 cysteine residues with an estimated pi of 8.88 (Figure 7, SEQ ID NO. 19). 
1 0 Alignment of the 75 kDa proteins from TIGR strain 26695 and ATCC 43579 reveals 
that 26 amino acid residues are not conserved. 

The DNA sequence of the 77 kDa gene from strain ATCC 43579 is shown in SEQ ID 
NO. 5. The open reading frame corresponding to the mature protein is 21 66 bp long, the 
1 5 same as the TIGR predicted protein. The translated protein is 721 aas long containing 6 
cysteine residues with an estimated pi of 8.89 (SEQ ID NO. 2). Alignment of the 77 
kDa proteins from TIGR strain 26695 and ATCC 43579 reveals substitutions in 67 aas. 
An additional 3 aas are unique to the TIGR protein, and an additional 1 1 aas are unique 
to the ATCC 43579 sequence. 

20 

The DNA sequence of the 79 kDa gene from strain PBCC 1 107 is shown in Figure 8, 
SEQ ID NO. 22. The open reading frame corresponding to the mature protein is 2157 
bp long, 21 bp shorter than the TIGR predicted protein. The translated protein from 
strain PBCC 1 107 is 718 aas long containing 8 cysteine residues with an estimated pi of 
25 6.99 which is considerably less than the pi of 8.68 predicted by the TIGR sequence 
(Figure 9, SEQ ID NO. 20). Alignment of the 79 kDa proteins from the TIGR strain 
26695 and PBCC 1 107 revealed 99 amino acid residue changes. There are 6 aas unique 
to the TIGR 79 kDa sequence, and 1 aa unique to the PBCC 1 107 79 kDa sequence. 

30 Example 13 

Comparison of expression constructs for the overproduction of the 77, 75 and 79kd 
proteins of //. pylori 
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13.1 - 77K. The full length gene was cloned into pETl 7b using the Ndel site to fuse the 
5' ends of the 77K gene to the vector ATG start codon, resulting in the full unaltered 
gene fused to the strong T7 promoter (pPX7762, Table 4). The full length gene was also 
cloned into an arabinose expression vector which also has strong promoter and which is 
5 tightly regulated. In addition, a third construct cloned the mature 77K gene to an ompT 
leader in pET12b which could enhance the translation and signal processing of the 
mature 77K gene (pPX7782). None of the constructs containing the gene with a signal 
sequence were overexpressed in excess of 1% total cellular protein (Table 4). 

1 0 Primers was made corresponding to the start of the mature protein and the end 

of the gene (Table 3, primers D and E). DMA sequence corresponding to the mature 
protein was PCR amplified using the pPX7760b as template DNA and cloned into 
PCR2.1 (pPX7769). The gene was cleaved from pPX7769 by digestion with Asp718 
and Spel and inserted into the T7 promoted pETl 7b (Novagen, Madison, Wisc.)at the 

1 5 Asp71 8 and Spel sites. The resulting plasmid fused 1 8 amino acid residues from the 
vector to the mature 77 kDa gene. Moderate expression of the 77 kDa protein was 
observed, but the induced protein was not visible by coomassie stained gels, indicating 
that the recombinant protein constituted less than 1 % of the total cellular protein. The 
mature gene fragment was also cloned into the arabinose expression vector pBAD24, 

20 which provided the ATG methionine and two additional vector-encoded amino acids to 
the mature gene. The recombinant plasmids were housed in the appropriate E. coli 
strains, BLR(DE3) pLysS for T7 promoted vectors and ToplOF(ara-) for arabinose 
vectors. Overnight cultures were seeded into HYSOY media containing 0.5-1 .0% 
glycerol and the appropriate antibiotic for selection. Cultures in mid-logarithmic phase 

25 were induced either with 1 mM IPTG (T7 vectors) or 0.0002-0.2% L-arabinose 

(arabinose vectors) and induction was allowed to proceed for 2 hours. Cells were 
harvested and resuspended in IX SDS-PAGE loading buffer and run on a 12% SDS- 
PAGE gel. The pBAD24 construct produced the recombinant 77 kDa protein which was 
detected by Western blot but not by coomassie stained gels. 

30 

Figure 11 shows that the 77K protein expressed from pE717b was not above 1% 
of the total cellular protein. The pRSETb based clone, pPX7796, was able to 
overexpress the protein greater than 10-20% of total cellular protein. Analysis of the 
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differences between pET17b and pRSETb reveals that both vectors have identical 
promoter sequences. A significant difference between the two vectors is in the copy 
number. pRSET b is based on the pUC replicon which maintains 100-700 copies per 
cell whereas pET vectors are based on a pMBl/ColEl replicon maintaining 25-30 
5 copies per celL A strong promoter coupled with a relatively high gene dosage resulted 
in overproduction of the recombinant mature 77K protein. 

Example -75K . The mature gene (pPX7768) was amplified with primers D and E (Table 
3) to obtain a fragment with appropriate restriction site ends for cloning into the 

1 0 arabinose expression vector pBAD24 (Guzman et al. 1995. J. Bacteriol. 177:4121- 
4130) , fusing the vector ATG start codon and two additional vector encoded amino 
acids to the mature gene (pPX7794). The 75 kDa gene was expressed by this vector 
following induction with arabinose, but the level was approximately 1% of cellular 
protein and the recombinant protein was prone to proteolytic cleavage. The mature 

1 5 coding region was cloned into the T7 polymerase expression system pRSETb 

(Invitrogen, San Diego Ca) by amplifying the 75 kDa gene from pPX7768 using primers 
J and K (Table 3) and Hgating the fragment into the Ndel and EcoRV sites of pRSETb. 
The resulting recombinant piasmid (pPX781 1) fuses the vector ATG to the mature 75 
kDa coding sequences. When the recombinant piasmid was induced by IPTG in the 

20 BLR(DE3) pLysS expression strain (Novagen, Madison, Wise), the recombinant 
protein was approximately 1 0% of the cell mass and was resistant to proteolysis. 
Comparison of recombinant protein yields after induction reveals that pPX781 1 was 
clearly superior in overexpressing the 75K mature protein (Figure 11). 

25 Example -79K The mature 79K gene from strain PBCC 1 1 07 was cloned into the 

arabinose vector pBAD24 fusing the vector ATG and 3 other amino acids to the start of 
the 75K gene (pPX5043, Table 4) and to the pRSET vector, fusing the vector ATG to 
the start of the mature 75K gene (pPX5048, Table). Comparison of recombinant protein 
yields after induction reveals that pPX5048 was superior in overexpressing the 79K 

30 mature protein (Figure 12). 

Notes on Recombinant protein induction and purification for the production of 75, 
77, 79K proteins 



46 



WO 00/00614 



PCT/US99/14375 



For the production of high yield of recombinant protein, the ampicilin marker used in 
plasmids pPX7796, pPX781 1 and pPX5043 is partially deleted and a gene for 
Kanamycin resistance is cloned in the site. This allows for fermentation of the proteins 
for use in humans. Strain E. coli BLR(DE3)pLysS housing either pPX7796, pPX781 1 or 
5 pPX5043 is grown in HYSOY media containing glycerol until logarithmic phase and 

induced with 0.5-1 mM IPTG. Induction proceeds for 2-4 hours after which the bacteria 
is harvested by centrifugation. 

Table 3A 



Primer 


Description 


SEQ. ID NO. 


A 


5' start of 77 and 79 kDa genes (sense) 


8 


B 


3' end of 75, 77, and 79 kDa genes (antisense) 


9 


C 


Promoter region of 75 kDa gene (sense) 


10 


D 


5' start mature 75, 77, and 7kDa genes (sense) 


11 


E 


3' end of 75, 77, and 7kDa genes (antisense) 


12 


F 


Internal 75 kDa specific (sense) 


13 


G 


Internal 75 kDa specific (antisense) 


14 


H 


Internal 75 kDa seamless cloning (sense) 


15 


1 


Internal 7kDa seamless cloning (antisense) 


16 


J 


5' end start mature 75, 77, and 7kDa genes (sense) 


17 


K 


3' end of 75, 77, and 7kDa genes (antisense) 


18 



Table 3B 



Primer 


Sequence (5* to 3') 


SEQ. ID NO. 


A 


GGCCATATGAAAAAACACATCCTTTCATTAGCT 


8 


B 


GGCAAGCTTGGGAGTTTCACAAAAAGCTTAGTAAGCGAACAC 


9 


C 


CGAAATCTTGTGATAAGATC 


10 


D 


CCGGGCTTGGTACCGGAAGACGACGGCTTTTAC 


11 


E 


CCGGGCACTAGTTTAGTAAGCGAACACATAATTCAAATACACGC 


12 


F 


CGGGGGGCAAACCAAATACAC 


13 


G 


GTGTATTTGGTTTGCCCCCCG 


14 


H 


GCCGAACGTCTCCCCTAAATGAGGCATGCCCAAACT 


15 


I 


CTACGGCGTCTCTTAGGGTAGTGATAATGATGCTCGCTTG 


16 


J 


CGCGGCGATATGGAAGACGACGGCTTTTAC 


17 


K 


CCGGCCGGTACCTTAGTAAGCGAACACATA 


18 
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Example 14 

Generation of peptide specific polyclonal antibodies to 
the 77 and 79 kDa proteins from H. pylori 

5 

14.1 Summary. 

Ten peptides representing unique sequences of the 77 and 79 kDa proteins (five to each) 
from K pylori were conjugated to CRM 197. CRM 1 97 is a well known carrier protein 
as described in Uchida, T. et aL, 1971 Nature New Biology, Vol. 233, 8-11. All of the 
1 0 conjugates were characterized by SDS-PAGE and amino acid analysis. The soluble 

conjugates were characterized by MALDI-TOF mass spectrometry analysis. An animal 
study was performed using the peptide conjugates as the antigen in order to produce 
antisera that would specifically recognize either the 77 or 79 kDa protein of H. pylori by 
western blot for purification protocols. 

15 

14.2 Peptides Used For Conjugation. 

All peptides were synthesized using the Gilson Multiple Peptide Synthesizer with aN- 
terminal cysteine residue necessary for covalent attachment to the carrier CRM 197 
protein in the specific conjugation chemistry used. The amino acid sequences of the 
20 peptides representing the 77 and 79 kDa peptides can be found in Table 5. All peptides 
were purified to near homogeneity by reversed phased HPLC and characterized by 
MALDI-TOF mass spectrometry and amino acid composition analysis. 



14.3 Preparation of Peptide-CRMi97 Conjugates. 

25 The 77 and 79 kDa peptides were conjugated to CRM197 using the crosslinking reagent 
N-succinimidyl bromoacetate (Bernatowicz et al.). On the day of the conjugation, the 
peptides were reacted with 5,5'-dithio bis(2-nitrobenzoic acid) [Ellman's reagent] to 
verify the content of free SH groups (resulted in greater than 95% free SH groups). Free 
amino groups of CRM 197 were bromoacteylated by reaction with an excess of 

30 bromoacetic acid N-hydroxysuccinimide ester (Sigma Chemical Co.). To an ice cold 

solution of CRM197, 10% (v/v) 1 .0 M NaHC03 (pH 8.4) was added. Bromoacetic acid 
N-hydroxysuccinimide ester, equal in weight to that of CRM 197 used, was dissolved in 
200 u.1 dimethylformamide, added slowly to the CRM] 97 and gently mixed at room 
temperature in the dark for 1 hour. The resulting bromoacetylated (activated) protein 
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was purified by passage through a P6-DG column using PBS, 1 mM EDTA (pH 7.0) as 
the eluent. Following purification, the fractions corresponding to activated CRM197 
were pooled and the protein concentration was estimated by BCA protein assay (Pierce 
Chemical Co.). To determine the extent of bromoacetylation, the protein amino groups, 
5 both before and after treatment with bromoacetic acid N-hydroxysuccinimide ester, 
were reacted with 2,4,6-trinitrobenzenesulfonic acid (TNBS). Approximately 70-75% 
of the amino groups were found to be bromoacetylated (activated). Following TNBS 
assay, approximately 2.4 - 4.6 mg of peptide was dissolved in sterile distilled water to 
an approximate concentration of 20 mg/ml. The peptide was slowly added to cold 
1 0 activated CRM j 97 in a 1:1 ratio (w/w) and the pH was adjusted to approximately 7.0- 
7.2. The resulting material was gently mixed overnight at 4°C in the dark followed by 
dialysis in the dark against two 1 L changes of PBS, pH 7.2. Each conjugate was then 
transferred into a sterile 15 ml polypropylene tube, wrapped in aluminum foil and stored 
at 4°C 

15 

14.4 Characterization of Peptide-CRMj97 Conjugates. 

To verify conjugation, all peptide-CRM]97 conjugates were analyzed by amino acid 
analysis (for the presence of the characteristic S-carboxymethylcysteine residues) and 
SDS-PAGE. The soluble conjugates and activated CRM 197 were subjected to MALDI- 
20 TOF mass spectrometry analysis. The protein concentration was determined by the 
average of values determined by amino acid analysis and BCA (Table 6). 

14.4(a) Amino Acid Analysis of Peptide-CRMj97 Conjugates. 

A suitable aliquot of each conjugate was hydrolyzed using 6N HC1 in the presence of 
25 5% phenol (v/v) and 1% 2-mercaptoethanol (v/v) under vacuum for 22 hours at 1 10°C. 
Following evaporation and resolubilization in Beckman sodium citrate sample dilution 
buffer (pH 2.2), the samples were analyzed on a Beckman 6300 Amino Acid Analyzer 
according to manufacturer's instructions. The degree of conjugation was determined by 
the estimated amount of S-carboxymethylcysteine residues formed per mole of 
30 CRM] 97. The degree of conjugation for each conjugate is described in Table 7. 

14.4(b) SDS-PAGE Analysis of Peptide-CRMi9 7 Conjugates 
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A suitable aliquot of each conjugates was mixed with reducing sample buffer and heated 
at 100°C for 5 minutes. The conjugates and Mark 12 molecular weight (MW) standards 
(Novex) were loaded on a 10 % (w/v, acrylamide) gel and SDS-PAGE (Laemmli) was 
carried out. Following SDS-PAGE, the gel was stained with Coomassie brilliant blue 
5 and destained. The samples exhibited a smear staining pattern across a wide MW range 
that is characteristic of similar peptide conjugates. 

14.4(c) MALDI-TOF Mass Spectrometry Analysis of Peptide-CRMi97 
Conjugates. 

10 A suitable aliquot of activated CRM 197 and each soluble conjugate were analyzed by 
MALDI-TOF mass spectrometry using 3,5-dimethoxy-4-hydroxy-cinnamic acid 
(sinapinic acid) as the matrix. Due to the addition of bromoacetyl groups (120 mass 
units each) to 29 lys residues of CRM 197 (mol. wt 58,408.7), which was based on 
approximately 75% activation from the TNBS assay, activated CRM 197 was expected 

1 5 to have a mol. wt. of [58,408.7 + (29x120)] - 61,889. The experimental mol. wt. of 
activated CRM 197 determined by MALDI-TOF mass spectrometry was 61,208 for 
activated CRM197 used for the 77k peptide conjugates and 61,009 for activated 
CRM 197 used for the 79k peptide conjugates. The degree of conjugation for each 
conjugate was calculated by subtracting the mass value of activated CRM 197 from the 

20 mass value of each conjugate and dividing by the mass of the peptide used to prepare the 
conjugate. The degree of conjugation for the soluble 77k and 79k peptide conjugates is 
described in the table 8. The mass values for heavily precipitated conjugates could not 
be determined. 

25 14.5 Animal Study. 

An animal study using the peptide conjugates was performed to produce antisera that 
will differentially recognize the 77kDa protein from the 79kDa protein in H. pylori blot. 
Table 9 describes the protocol for the animal study. 

30 14.6 Analysis of Immune Sera by Peptide ELISA. 

AH peptide ELISAs were run using Nunc Maxisorp 96-welI plates. Weeks 0 and 6 
peptide conjugate antisera were all titered against homologous peptides. Plates were 
coated with 100 uJ of peptides diluted to 1.0 u,g/ml with 50 mM sodium bicarbonate (pH 
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9.0) and incubated at 37°C overnight. Plates were washed and blocked for 1 hour at 
370C with 250 u,l of 3% BSA in 137 mM tris buffered saline (TBS), pH 7.6. Peptide 
conjugate antisera were serially diluted into 0.3% BSA in TBS containing 0.05% 
Tween-20, 100 u.1 of each dilution was added to the plates and incubated at 37°C for 1 
5 hour. The plates were washed and subsequently incubated for 1 hour at 37°C with 100 
pi of alkaline phosphatase conjugated goat anti mouse secondary antibody (Zymed) 
diluted 1:1500 in 0.3% BSA in TBS containing 0.05% Tween-20. The plates were 
washed and finally incubated for 1 hour at room temperature with 100 u.1 p-nitrophenyl 
phosphate substrate prepared in diethanolamine containing 0.5 mM MgCl2- Plates were 
1 0 read using an automated 96-well plate reader with a 405 nm test and 690 nm reference 
filter. All endpoint titers were calculated at 60 minutes at 0.1 AU. Peptide ELISA 
endpoint titers are shown in table 6. 

14.7 Analysis of Immune Sera by Western blot 

1 5 Week 6 sera from mice immunized with peptide groups: 97-7; #1, #6, #7 and 97-14; 
#14, #17 was used, (as stated in the Western Blot protocol section ) as the primary 
antibody in a Western blot against whole cell lysates of Helicobacter pylori and the 
recombinant 75, 77, and 79 kDa proteins. Conjugate 97-14 #14 generated a strong 
specific signal to the 79 kDa protein in both the H.pylori sample and the recombinant 79 

20 kDa protein, but not to native or recombinant 75 and 77 kDa proteins. 

Table 5. Sequences of peptides used in the 77 and 79 kDa conjugates. 
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Conjugate 


Peptide Sequence 


97-6 #3/CRM !97 (77k) 


CASGNTSHVITNKLDGVPDS (SEQ 23) 


97-6 #7/CRM, 97 (77k) 


CSPSVNGTKTTTQTIDGK (SEQ 24 


97-6 #9/CRM 19V (77k) 


CYFHATNSSEANAPKFS (SEQ 25) 


97-6#15/CRM i97 (77K) 


CNPENLSENFKN (SEQ 26) 


97-6 #20/CRM 197 (77k) 


CSGQGNNN (SEQ 27) 


97-7 #l/CRM l97 (79k) 
97-7 #6/CRM 197 (79k) 
97-7#ll/CRM l97 (79k) 
97-7#14/CRM 197 (79k) 
97-7#17/CRM I97 (79k) 


CVMKNNNNVNEKLAGFGKEEVM (SEQ 28) 
CKAKNGSSSSSNGGNGSS (SEQ 29) 
CTTTYNNNKATVKFDIT (SEQ 30) 
CLVRSTNNENTPGGGQ (SEQ 31) 
CRQTADINGGVYQF (SEQ 32) 
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Table 6. Concentration of 77 and 79kDa peptide conjugates. Concentration was 
determined by averaging the values determined by amino acid analysis and BCA. 



Conjugate 


Concentration (mg/ml) 


97-6 #3/CRM I97 (77kDa) 


3.5 


97-6 #7/CRM I97 (77kDa) 


2.6 


97-6 #9/CRM 197 (77kDa) 


3.8 


97-6 #15/CRM I97 (77kDa) 


2.5 


97-6 #20/CRM 197 (77kDa) 


3.2 


97-7 #1 /CRM 197 (79kDa) 


1.8 


97-7 #6/CRM I97 (79kDa) 


0.9 


97-7 #1 1/CRM 197 (79kDa) 


3.3 


97-14 #14/CRM 197 (79kDa) 


3.3 


97-14 #17/CRM 197 (79kDa) 


3.7 
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Table 7, Degree of conjugation for each peptide conjugate based on amino acid 
analysis. 



Conjugate 


Degree of Conjugation 
(moles peptide per mole CRM 197 ) 


y/'O ttj/K^ssjyl ]97 ^/ /KJJaj 


1 1 


y /-o ft //ckm^ ^ / /KDaj 


9 


97-6 #9/CRM I97 (77kDa) 


17 


97-6#15/CRM 197 (77kDa) 


8 


97-6 #20/CRM l97 (77kDa) 


18 


97-7#l/CRM 197 (79kDa) 


5 


97-7 #6/CRM 197 (79kDa) 


12 


97-7#ll/CRM I97 (79kDa) 


14 


97-14 #14/CRM I97 (79kDa) 


13 


97-14 #17/CRM,„ (79kDa) 


17 
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Table 8. Degree of conjugation of 77k and 79kDa peptide conjugates. 
The mass of the conjugate is approximate due to the irregular shape of 
the peak (broad and sometimes rough) possibly representing a 
distribution of conjugates containing different amounts of peptides 
5 attached. ND = not determined due to precipitate. 



Conjugate 


(Approximate) 


Mass of Peptide 


Degree of 
Conjugation 
(# peptides per 
CRM 197 ) 


97-6 #3/CRM ]97 (77kDa) 


79,555 


2,013 


9.1 


97-6 #7/CRM 197 (77kDa) 


74,121 


1,836 


7.0 


97-6 #9/CRM 197 (77kDa) 


86,910 


1,872 


13,7 


97-6 #15/CRM l97 (77kDa) 


68,870 


1,407 


5.4 


97-6 #20/CRM l97 (77kDa) 


72,036 


792 


13.7 


97-7#l/CRM 197 (79kDa) 


ND 


2,466 


ND 


97-7 #6/CRM 197 (79kDa) 


ND 


1,627 


ND 


97-7 #1 1/CRM 197 (79kDa) 


ND 


1,932 


ND 


97-14 #14/CRM I97 
(79kDa) 


79,231 


1,645 


11.1 


97-14 #17/CRM 197 
(79kDa) 


ND 


1,570 


ND 
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Table 9. 77k and 79kDa peptide conjugate animal study protocol. Swiss Webster mice were 
used for entire study. Injection volume = 100 jal; B = bleed; V = vaccinate. 



Conjugate 


Dose fig 


Adjuvant 


WkO 


Wk3 


Wk6 


Wk8 


97-6 #3/CRM 197 (77kDa) 


10 


20 jig QS-21 


B, V 


V 


B, V 


B 


97-6 #7/CRM 197 (77kDa) 


10 


20 jig QS-21 


B,V 


V 


B, V 


B 


97-6 #9/CRM 197 (77kDa) 


ID 


20 fig QS-21 


D, V 


V 


£5, V 


R 
o 


97-6#15/CRM 197 
(77kDa) 


10 


20 jig QS-21 


B, V 


V 


B, V 


B 


97-6 #20/CRM 197 
(77kDa) 


10 


20 fig QS-21 


B, V 


V 


B> V 


B 


97-7#l/CRM ]97 (79kDa) 


10 


20 jig QS-21 


B,V 


V 


B, V 


B 


97-7 #6/CRM 197 (79kDa) 


10 


20 fig QS-21 


B,V 


V 


B, V 


B 


97-7#ll/CRM l97 
(79kDa) 


10 


20 jig QS-21 


B, V 


V 


B, V 


B 


97-14 #14/CRM I97 
(79kDa) 


10 


20 jig QS-21 


B, V 


V 


B, V 


B 


97-14 #17/CRM 197 
(79kDa) 


10 


20 fig QS-21 


B, V 


V 


B, V 


B 
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Table 10. Weeks 0 and 6 peptide ELISA endpoint titers of 77 and 79kDa peptide 
conjugate antisera against homologous peptides 



Conjugate 


Week 0 Titer 


Week 6 Titer 


97-6 #3/CRM 197 (77Da) 


478 


1,397,000 


97-6 #7/CRM 197 (77kDa) 


375 


513,400 


97-6 #9/CRM 19? (77kDa) 


324 


967,900 


97-6#15/CRM l97 (77kDa) 


504 


1,066,000 


97-6 #20/CRM I97 (77kDa) 


511 


105,600 


97-7#l/CRM 197 (79kDa) 


369 


> 2,187,000 


97-7 #6/CRM 197 (79kDa) 


468 


120,400 


97-7#ll/CRM 197 (79kDa) 


492 


1,438,000 


97-14 #14/CRM 197 (79kDa) 


261 


523,000 


97-14 #17/CRM 19V (79kDa) 


2,242 


1,644,000 



Example 15 

Prophylactic and Therapeutic Mouse Models 
with 75/77 kDa Proteins 

10 

15.1 Prophylactic Mouse Model. 

C57BL/6 mice were vaccinated intragastrically with 100 u.g native 75.77 kDa proteins 
admixed with 10 u,g CT, 10 |ng CT-E29H (an attenuated cholera toxin mutant as 

1 5 disclosed in U.S. Provisional Application No. 60/102,430 which is hereby incorporated 
herein), 100 u,g CpG20mer (a CpG oligonucleotide sequence, see Davis et al., 1998. J. 
Immunol. 160:870-876), or 10 |xg CT-E29H + 100 jig CpG20mer on days 0, 2,14, and 
16. Mice were challenged intragastrically with 2 x 10** cfu H. pylori strain SSI on day 
31. Organisms recovered on days 58 and 59 are expressed as logio c ^ u P er g ram °f 

20 stomach tissue +/- 1 standard error of the mean. The number of colony forming units 
recovered was reduced by approximately 1-2 logs in mice vaccinated with 75/77 kDa 
protein admixed with CT, CT-E29H, and with CT-E29H + CpG20mer (Table 13). 
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15.1(a) Immunogenicity of native 75/77 kDa proteins. 

Pre-challenge pooled sera (day 30) from mice vaccinated as above were analyzed for the 
presence of antibodies to the native 75/77 kDa proteins by standard ELISA (Table 1 1). 
5 Samples containing mucosal antibodies were also screened for specific antibodies to the 
75/77 kDa proteins (Table 12). 

15.2 Therapeutic Mouse Model. 

C57BL/6 mice were infected intragastrically with 2x10** cfu H. pylori strain SSI on 
1 0 day 0. For intragastric vaccinations, mice received native 100 fig 75/77 kDa proteins, 
recombinant urease, or KLH admixed with CT on days 3 1, 33, 45, and 47. For 
subcutaneous vaccinations, mice were injected with 10 mg native 75/77 proteins 
admixed with AIPO4 on days 32 and 60. A challenged but unvaccinated control group 
was included to assess the role of nonspecific clearance from CT in the KLH/CT group. 
1 5 Recovery of organisms 2 and 4 wks after the last vaccination is expressed in log 1 0 cfu 
per gram of stomach tissue +/- 1 standard deviation. The number of colony forming 
units recovered was significantly reduced at the 2 wk time point for mice receiving 
native 75/77 kDa proteins intragastrically admixed with CT, and at the 4 wk time point 
for mice receiving native 75/77 kDa proteins subcutaneously admixed with AIPO4 
20 (Figure 14). 

Example 16 

16.1 Vaccine Characterization of Recombinant Proteins 75, 77 and 79. 

The above protocols for the animal studies are repeated for each of the proteins to 
25 establish the specific attributes by each protein as a vaccine. 
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We claim: 

1 . An isolated, substantially purified polypeptide comprising a 
5 polypeptide ofH. pylori selected from the group consisting of: 

(i) a polypeptide having a molecular weight of about 75 kDa; 

(ii) a polypeptide having a molecular weight of about 77 kDa; and 

10 

(iii) a polypeptide having a molecular weight of about 79 kDa; 

wherein each polypeptide has the amino-terminal amino acid sequence 
EDDGFYTSVGYQIGEAAQMV (SEQ ID No.: 7). 

15 

2. The purified polypeptide of claim 1 wherein the polypeptide of K 
pylori is a polypeptide having a molecular weight of about 75 kDa. 

3. The purified polypeptide of claim 1 wherein the polypeptide of H. 
20 pylori is a polypeptide having a molecular weight of about 77 kDa. 

4. The purified polypeptide of claim 1 wherein the polypeptide of H. 
pylori is a polypeptide having a molecular weight of about 79 kDa. 

25 5. An isolated polypeptide of H. pylori selected from the group 

consisting of: 

(i) a polypeptide having a molecular weight of about 75 kDa and having 
the amino acid sequence of SEQ. ID NO. 3 or the amino acid sequence of SEQ. ID 
30 NO.19; 
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(ii) a polypeptide having a molecular weight of about 77 kDa and having 
the amino acid sequence of SEQ. ID NO.2; and 

(iii) a polypeptide having a molecular weight of about 79 kDa and having 
5 the amino acid sequence of SEQ. ID NO.3 or the amino acid sequence of SEQ. ID NO. 

20; 

6. The isolated polypeptide of claim 5 wherein the polypeptide has a 
molecular weight of about 75 kDa. 

10 

7. The isolated polypeptide of claim 5 wherein the polypeptide has a 
molecular weight of about 77 kDa. 

8. The isolated polypeptide of claim 5 wherein the polypeptide has a 
1 5 molecular weight of about 79 kDa. 

9. An antigenic composition of a substantially purified polypeptide 
comprising an isolated, substantially purified polypeptide of H. pylori selected from 
the group consisting of: 

20 

(i) a polypeptide having a molecular weight of about 75 kDa; 

(ii) a polypeptide having a molecular weight of about 77 kDa; and 

25 (iii) a polypeptide having a molecular weight of about 79 kDa; 

wherein each polypeptide has the amino-terminal amino acid sequence of 
EDDGFYTSVGYQIGEAAQMV (SEQ ID No.: 7) and 

30 (b) one or more of a pharmaceutical ly acceptable buffer, or diluent, adjuvant 

or carrier. 
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1 0. The antigenic composition of claim 9 wherein the carrier is conjugated 
to said polypeptide. 

5 11. The antigenic composition of claim 9 wherein said adjuvant is a 

mucosal adjuvant. 

12. The antigenic composition of claim 9 wherein said adjuvant comprises 

a liquid. 

10 

13. A vaccine comprising an antigenic composition comprising an 
immunologically effective amount of an isolated, substantially purified polypeptide of 
H. pylori selected from the group consisting of: 

15 (i) a polypeptide having a molecular weight of about 75 kDa; 

(ii) a polypeptide having a molecular weight of about 77 kDa; and 

(iii) a polypeptide having a molecular weight of about 79 kDa; 

20 

wherein each polypeptide has the amino-terminal amino acid sequence 
EDDGFYTSVGYQIGEAAQMV (SEQ ID No.: 7); and 

(b) one or more of a pharmaceutically acceptable buffer , diluent, adjuvant 
25 or carrier. 

14. A method of inducing an immune response in a mammal which 
comprises administering to said mammal an immunologically effective amount of the 
antigenic composition of claim 9. 

30 
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15. An isolated and purified nucleic acid sequence comprising a 
nucleotide sequence which hybridizes under high stringency Southern hybridization 
conditions with a nucleic acid sequence encoding a polypeptide selected from the 
group consisting of: 

5 

(i) a polypeptide having a molecular weight of about 75 kDa; 

(ii) a polypeptide having a molecular weight of about 77 kDa; and 

1 0 (iii) a polypeptide having a molecular weight of about 79 kDa; 

wherein each polypeptide has the ami no-terminal amino acid sequence 
EDDGFYTSVGYQIGEAAQMV (SEQ ID No.: 7). 

1 5 16. The isolated and purified nucleic acid sequence of claim 1 5, wherein 

said nucleotide sequence hybridizes under high stringency southern hybridization 
conditions with a nucleic acid having the nucleotide sequence of nucleotides 58-2124 
of SEQ ID No.: 4 or the nucleotide sequence of nucleotides 1-2070 of SEQ ID No.: 21 

20 17. The isolated and purified nucleic acid sequence of claim 16, wherein 

said nucleic acid sequence has the nucleotide sequence of nucleotides 58-2124 of SEQ 
ID No. 4 or the nucleotide sequence of nucleotides 1-2070 of SEQ ID No.: 21 . 

18. An isolated and purified nucleic acid sequence comprising a 

25 nucleotide sequence encoding the polypeptide having the amino acid sequence of 

amino acids 20-708 of SEQ ID No. 1, the amino acid sequence of amino acids 1-689 
of SEQ ID No. 19, or biologically equivalent amino acid sequence thereof. 

19. The isolated and purified nucleic acid sequence of claim 15, wherein 
30 said nucleotide sequence hybridizes under high stringency southern hybridization 
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conditions with a nucleic acid having the nucleotide sequence of nucleotides 61-2223 
ofSEQ ID No.: 5. 

20. The isolated and purified nucleic acid sequence of claim 19, wherein 
5 said nucleotide sequence hybridizes under high stringency southern hybridization 

conditions with a nucleic acid having the nucleotide sequence of nucleotides 61-2223 
ofSEQ ID No.: 5. 

21. An isolated and purified nucleic acid sequence comprising a 

1 0 nucleotide sequence encoding the polypeptide having the amino acid sequence of 

amino acids 21-741 of SEQ ID No. 2, or biologically equivalent amino acid sequence 
thereof. 

22. The isolated and purified nucleic acid sequence of claim 15, wherein 
1 5 said nucleotide sequence hybridizes under high stringency southern hybridization 

conditions with a nucleic acid having the nucleotide sequence of nucleotides 61-2235 
of SEQ ID No.: 6 or the nucleotide sequence of nucleotides 1-2157 of SEQ ID No.: 22. 

23. The isolated and purified nucleic acid sequence of claim 22, wherein 
20 said nucleic acid sequence has the nucleotide sequence of nucleotides 61-2235 of SEQ 

ID No.: 6 the nucleotide sequence of nucleotides 1-2157 of SEQ ID No.: 22. 

24. An isolated and purified nucleic acid sequence comprising a 
nucleotide sequence encoding the polypeptide having the amino acid sequence of 

25 amino acids 21-745 of SEQ ID No. 3, or biologically equivalent amino acid sequence 
thereof or the amino acid sequence of amino acids 1 -7 1 8 of SEQ ID No. 20. 

25. A plasmid containing an isolated and purified nucleic acid sequence 
comprising a nucleotide sequence which hybridizes under high stringency Southern 

30 hybridization conditions with a nucleic acid sequence encoding a polypeptide selected 
from the group consisting of: 
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(0 



a polypeptide having a molecular weight of about 75 kDa; 



(ii) 



a polypeptide having a molecular weight of about 77 kDa; and 



5 



(iii) a polypeptide having a molecular weight of about 79 kDa; 

wherein each polypeptide has the amino-terminai amino acid sequence 
EDDGFYTSVGYQIGEAAQMV (SEQ ID No.: 7). 

10 

26. The plasmid of claim 25 wherein the plasmid contains an isolated and 
purified nucleic acid sequence, wherein said nucleotide sequence hybridizes under 
high stringency southern hybridization conditions with a nucleic acid having the 
nucleotide sequence of nucleotides 58-2124 of SEQ ID No.: 4 or the nucleotide 

1 5 sequence of nucleotides 1 -2070 of SEQ ID No.: 2 1 . 

27. The plasmid of claim 26 wherein the plasmid contains an isolated and 
purified nucleic acid sequence, wherein said nucleic acid sequence has the nucleotide 
sequence of nucleotides 58-2124 of SEQ ID No. 4 the nucleotide sequence of 

20 nucleotides 1-2070 of SEQ ID No.: 21. 

28. The plasmid of claim 25 wherein the plasmid contains an isolated and 
purified nucleic acid sequence comprising a nucleotide sequence encoding the 
polypeptide having the amino acid sequence of amino acids 20-708 of SEQ ID No. 1, 

25 the amino acid sequence of amino acids 1-689 of SEQ ID No. 19, or biologically 
equivalent amino acid sequence thereof. 

29. The plasmid of claim 25 wherein the plasmid contains an isolated and 
purified nucleic acid sequence, wherein said nucleotide sequence hybridizes under 

30 high stringency southern hybridization conditions with a nucleic acid having the 
nucleotide sequence of nucleotides 6 1 -2223 of SEQ ID No. : 5 . 
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30. The plasmid of claim 29 wherein the plasmid contains an isolated and 
purified nucleic acid sequence, wherein said nucleotide sequence has the nucleotide 
sequence of nucleotides 61-2223 of SEQ ID No.: 5. 

5 

31. The plasmid of claim 25 wherein the plasmid contains an isolated and 
purified nucleic acid sequence comprising a nucleotide sequence encoding the 
polypeptide having the amino acid sequence of amino acids 21-741 of SEQ ID No. 2, 
or biologically equivalent amino acid sequence thereof. 

10 

32. The plasmid of claim 25 wherein the plasmid contains an isolated and 
purified nucleic acid sequence, wherein said nucleotide sequence hybridizes under 
high stringency southern hybridization conditions with a nucleic acid having the 
nucleotide sequence of nucleotides 61-2235 of SEQ ID No.: 6 or the nucleotide 

1 5 sequence of nucleotides 1-21 57 of SEQ ID No.: 22. 

33. The plasmid of claim 32 wherein the plasmid contains an isolated and 
purified nucleic acid sequence of claim 22, wherein said nucleic acid sequence has the 
nucleotide sequence of nucleotides 61-2235 of SEQ ID No.: 6 or the nucleotide 

20 sequence of nucleotides 1-2157 of SEQ ID No.: 22. 

34. The plasmid of claim 25 wherein the plasmid contains an isolated and 
purified nucleic acid sequence comprising a nucleotide sequence encoding the 
polypeptide having the amino acid sequence of amino acids 21-745 of SEQ ID No. 3, 

25 the amino acid sequence of amino acids 1-718 of SEQ ID No. 20 or biologically 
equivalent amino acid sequence thereof. 

35. The method of claim 14 wherein said composition is administered 
parenterally. 

30 
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36. 



The method of claim 14 wherein said composition is administered 



orally. 



37. 



The method of claim 1 wherein the polypeptides are co-purified. 



5 



38. The plasmid of claim 25 wherein the plasm id is a high copy number 
plasmid generating at least about 100 copies per cell. 

39. The plasmid of claim 25 wherein the plasmid is a high copy number 
1 0 plasmid generating from about 100 to 1,000 copies per cell. 

40. The plasmid of claim 25 wherein the plasmid is a high copy number 
plasmid generating from about 100 to 700 copies per cell. 

15 41. The plasmid of claim 40 wherein a nucleic acid sequence encoding a 

selected polypeptide is operably linked to a strong promoter. 

42. The plasmid of claim 40 wherein the promoter is selected from the 
group consisting of T7 promoter, arabinose, lambda phage promoters, tac and trc 

20 promoters. 

43. The plasmid of claim 25 wherein the nucleic acid sequence encoding 
selected polypeptide is expressed in at least about 5 to about 50% of the cellular 
protein of a host cell 



44. The plasmid of claim 25 wherein the nucleic acid sequence encoding 
selected polypeptide is expressed in at least about 10 to about 40% of the cellular 
protein of a host cell 
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45. The plasmid of claim 25 wherein the nucleic acid sequence encoding a 
selected polypeptide is expressed in at least about 10 to about 30% of the cellular 
protein of a host cell 



a mature portion the desired polypeptide 

47. A host cell transformed with at least one plasmid of claims 25-34 and 
claims 38-46-. 

10 

48. A method of producing an H. pylori polypeptide which comprises 
transforming or transfecting a host cell with at least one the plasmid of claims 25-34 
and claims 38-46, and then culturing the host cell under conditions which permit the 
expression of said polypeptide by the host cell. 



5 



46. 



The plasmid of claim 43 wherein the nucleotide sequence encodes the 



15 



49. 



The host cell of claim 47 wherein the cell is a bacterial cell. 



50. 



The method of claim 48 wherein the host cell is a bacterial cell. 



20 



51. A polyclonal antibody which preferentially binds to one polypeptide 
of H. pylori polypeptides selected from the group consisting of: 



25 



(i) a polypeptide having a molecular weight of 75 kDa; 

(ii) a polypeptide having a molecular weight of about 77 kDa; and 

(iii) a polypeptide having a molecular weight of about 79 kDa; 



wherein each polypeptide has the amino-terminal amino acid sequence 
EDDGFYTSVGYQIGEAAQMV (SEQ ID No.: 7) 
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52. A method of inducing an immune response in a mammal which 
comprises administering to said mammal an immunologically effective amount of an 
antibody by which binds to a polypeptide of Claim 1 . 

5 53. The method of claim 14 wherein the polypeptide of the antigenic 

composition is recombinantly produced. 

54. A method for producing a desired H. pylori protein comprising: (a) 
transforming a selected host cell with at least one high copy number plasm id, which 

1 0 comprises the K pylori nucleotide sequence of interest operably linked to a strong 
promoter, and (b) growing the transformed host cell in culture media. 

55. A method of purifying recombinant H. pylori polypeptides which are 
expressed as inclusion bodies in host cells comprising (a) lysing the host cells and 

1 5 isolating the inclusion bodies by removing soluble proteins; (b) solubilizing the 
inclusion bodies in a zwitterionic detergent; and (c) purifying the solubilized 
inclusion bodies. 

56. The method of claim wherein at step (c) the solubilized inclusion 
20 body material is purified using a cationic exchange gel chromatograph. 
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GAAGACGACGGCTTTTACACAAGCGTAGGCTATCAAATCGGTGAAGCCGCTCAAAT 

GGTGAAAAACACCAAAGGCATCCAAGAGCTTTCAGACAATTATGAAAAGCTGAACAA 

TCTTTTGAATAATTACAGCACCCTAAACACCCTGATCAAATTATCCGCTGATCCGAG 

CGCAATTAACGACGCAAGGGATAATCTAGGCTCAAGCTCTAGGAATTTGCTTGATGT 

C AAAACC AAT TCCCCGGCG T ATC AAGCGG TGCTTTTAGCACTGAATGC TGC AG TGG 

GGTTGTGGCAAGTTACAAGCTATGCTTTTACTGCTTGTGGTCCTGGCAGTAACGAG 

AACGCGAATGGAGGTATCCAAACTTTTAATAATGTGCCAGGACAAAATACGACGAC 

CATCACTTGCAATTCGTATTATGAGCCAGGACATGGCGGGCCTATATCCACTGCAA 

ACTATGCGAAAATCAATCAAGCCTATCAAATCATTCAAAAGGCTTTGACAGCCAATG 

GAGCTAATGGAGATGGGGTCCCTGTTTTAAGCAACACCACTACAAAACTTGATTTCA 

CTATCAATGGAGACAAAAGAACGGGGGGCAAACCAAATAAACCATTAATATACCAAT 

GGAGTCATGGGAAAGCTATTTCAACCTCGTGGAATGACACACAAGCAACACCAACA 

ACAGAAAACATCAACACAGAAAATAACGCTCAAGAGCTTTTAAAGCAAGCGAGCATC 

ATTATCACTACCCTAAATGAGGCATGCCCAAACTTTCAGAATGGTGGTAGCGGTTAT 

TGGCAAGGGATAAGTGGCAATGGGACAATGTGTGGGATGTTTMGAATGAAATCAG 

TGCTATCCAAGGCATGATCGCTAACGCGCAAGAAGCTGTCGCGCAAAGTAAAATCG 

TTAGTGAAAACGCGCAAAATCAAAACAACTTGGACACTGGAAAACCATTCACCCCT 

ACACAGACGCCAGCTTCGCTCAAAGCATGCTCAAAAACGCTCAAGCCCAAGCAGAG 

ATT T T AAACCAAGCCG AACAAG TGG TG AAAAAC T TTG AAAAAATCCC T AG AAAT T T TG 

T ATCAG ACTCT T T AGGGG TG TG T T ATG AAG TGCAAGGGGG TG AGCGT AGGGGCAC 

CAATCCAGGTCAGGTAACTTCTAACACTTGGGGGGCCGGTTGCGCCTATGTGGGG 

CAAACCATAACAAATCTAAAAAACAGCATCGCTCACTTTGGCACTCAAGAGCAGCAA 

ATACAACGAGCCGAAAATATCGCTGACACTCTAGTGAATTTCAAATCTAGATACAGC 

GAATTGGGGAATACTTACAACAGCATCACCACTGCGCTCTCCAAAGTCCCTAACGC 

GCAAAGCTTGCAAAACGTGGTGAGCAAAAAGAATAACCCCTATAGCCCGCAAGGCA 

TAGAAACCAATTACTACTTGAATCAAAATTCTTACAACCAAATCCAAACCATCAACCA 

AGAATTAGGGCGTAACCCTTTTAGGAAAGTGGGCATCGTCGGCTCTCAAACCAACA 

ACGGCGCCATGAATGGGATCGGTATTCAGGTGGGCTACAAGCAATTCTTTGGCCAA 

AAAAGAAAATGGGGCGCTAGGTATTACGGCTTTTTTGATTACAACCATGCGTTTATT 

AAATCCAGCTTCTTCAACTCGGCTTCTGATGTGTGGACTTATGGTTTTGGAGCGGAC 

GCTCTCTATAACTTCATCAACGATAAAGCCACTAACTTTTTAGGCAAAAACAACAAG 

CTTTCTGTGGGGCTTTTTGGCGGGATTGCGTTAGCGGGCACTTCATGGCTTAATTC 

TGAGTATGTGAATTTAGCCACCGTGAATAATGTCTATAACGCTAAAATGAACGTGGC 

GAACTTCCAATTCTTATTCAACATGGGAGTGAGGATGAATTTGGCCAGGCCCAAGA 

AAAAAGACAGCGATCATGCGGCTCAGCATGGGATTGAGTTAGGGCTTAAAATCCCC 

ACCATCAACACGAACTACTATTCCTTTATGGGGGCTGAACTCAAATACAGAAGGCTT 

TATAGCGTGTATTTGAATTATGTGTTCGCTTACTAA 
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EDDGFYTSVGYQIGEAAQMVKNTKGIQELSDNYEKLNNLLNNYSTLNTLIKLSADPSAIN 
DARDNLGSSSRNLLDVKTNSPAYQAVLLALNAAVGLWQVTSYAFTACGPGSNENANG 
G I QTFNNVPGQNTTT I TCNSYYEPGHGGP I STANYAK I NQAYQI I QKAL T ANG ANGDGVP 
VLSNTTTKLDFT I NGDKRTGGKPNKPL I YQWSHGKA I STSWNDTQATPTTEN I NTENNA 
QELLKQASI I ITTLNEACPNFQNGGSGYWQG ISGNGTMCGMFKNE ISAIQGMIANAQEA 
VAQSK I VSENAQNQNNLDTGKPFNPYTDASFAQSMLKNAQAQAE I LNQAEQVVKNFEK I 
PRNFVSDSLGVCYEVQGGERRGTNPGQVTSNTWGAGCAYVGQT I TNLKNS I AHFGTQ 
EQQIQRAENIADTLVNFKSRYSELGNTYNSITTALSKVPNAQSLQNWSKKNNPYSPQGI 
ETNYYLNQNSYNQIQT I NQELGRNPFRKVG I VGSQTNNGAMNG IGIQVGYKQFFGQKR 
KWGARYYGFFDYNHAF I KSSFFNSASDVWTYGFGADAL YNF I NDKATNFLGKNNKLSV 
GLFGGIALAGTSWLNSEYVNLATVNNVYNAKMNVANFQFLFNMGVRMNLARPKKKDS 

DHAAQHG IELGLK I PT I NTN YYSFMG AE L K YRRL YSVYL NYVF AY 

FIG.7 



EDDGFYTSVGYQIGEAAQMVTNTKG IQELSDNYEKLNNLLNSYSTLNTL IKLSADPSAVS 
GAINNLNAGATGLLKEKTNSPAYQAVSLALNAAVGLWNTIGYAVMCGNGNGTSNGPGS 

V I FNGEPGQGSTQ I TCNRFESTGPGHSMS I DEFKKLNQAYQ 1 1 QQALKGNGFPELGNNG 
TSVSVEYKYECKQNNNDIDGGVRQFCQAKNGSSSGSNGSSGNSKQTTTQDGVT I TTT 
NSSNE ATVNFS I TNNAEQLLNQAAN I MQVLNTQCPLVRSTNDENAPGGGQPWGLSTTG 
NACQ I FQQEFSQVTSM I KNAQE I VAQSQ I ANANQKAE I TNPSNFNPF TDAGF AQNMLKN 
ARAQAEMFNLAEQVKKNLEVMQNNNNVNAELSGFGDGMTNFVSAFLASCKNDGTLPN 

QGVTSNTWGAGCAYVGET I TALNNS I AHFGTQAEQ I EQAEN 1 ADTLVHFKSRYNE LGNT 
YNS I TT ALSKVPNAQSLQNVMGKKNNPYSPQG I ETNYYLNQNSYNQIQT I NQELGRNPF 
RKVG I VSSQTNNGAMNGIG IQVGYKQFFGQKRKWGARYYGFFDYNHAF IKSSFFNSAS 
DVWTYGFGVDALYNFINDKATNFLGKNNKLSVGLFGGIALAGTSWLNSEYVNLATMNN 
VYNAKMDVANFQF LF NMGVRMNL ARPKKKGSDHAAQHG I ELGLK I PT I NTNYYSFMGA 
ELKYRRLYSVYLNYVFAY 

FIG.9 
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GAAGACGACGGCTTTTACACAAGCGTAGGCTATCAAATCGGTAAGCCGCTCAAAT 

GG T AACAAAC ACCAAAGGCATCCAAGAGC T T TCAG ACAATT ATG AAAAGCTG AACAA 

TCTTTTGAATAGTTACAGCACCCTAAACACCCTTATCAAATTGTCCGCTGATCCGAG 

CGCGG TC AG TGGCGCG ATCAAC AAT T TG AACGCGGGCGCG ACGGG T T TGCTCAAA 

G AAAAAACCAACTCCCCCGCCT ATCAAGCCG TCTCT T TGGCG T T AAACGCCGCAG T 

GGGCTTGTGGAATACCATCGGCTATGCGGTCATGTGCGGGAACGGGAACGGCACA 

TCGAATGGGCCTGGCAGCGTGATCTTTAATGGAGAGCCAGGACAGGGCTCCACGC 

AGATTACTTGCAACCGCTTTGAATCAACTGGTCCTGGTCACAGCATGTCTATTGATG 

AATTCAAGAAACTCAATCAAGCCTATCAAATCATCCAACAAGCTTTAAAAGGAAATG 

GGTTTCCTGAACTAGGCAACAACGGCACAAGCGTGAGTGTTGAATATAAATATGAAT 

GCAAACAGAATAATAATGATATCGACGGCGGTGTGAGACAATTCTGCCAAGCGAAA 

AATGGTAGTAGCAGTGGTAGTAATGGCAGTAGTGGTAATAGCAAGCAAACAACCAC 

GCAAG ACGGCG T AACGATCACC ACT ACT AAT AG T AG T AACG AAGCC ACCG TCAAT T 

TTAGCATCACCAATAACGCTGAACAGCTGTTAAATCAAGCGGCAAACATCATGCAAG 

TCCTTAACACGCAATGCCCTTTGGTGCGATCTACGAATGATGAAAACGCTCCAGGG 

GGTGGCCAACCATGGGGTTTAAGCACGACAGGGAATGCGTGCCAAATCTTCCAACA 

AGAATTTAGCCAGGTTACTAGCATGATCAAAAACGCCCAAGAAATCGTCGCGCAAA 

GCCAAATCGCTAACGCTAACCAAAAAGCAGAAATAACTAACCCCAGTAACTTCAACC 

CTTTCACGGACGCTGGCTTTGCACAAAACATGCTCAAAAACGCTAGAGCGCAAGCA 

G AG ATG TTCAAT T T AGCTG AGCAAG TG AAAAAG AAT T TGGAAG TCATGC AAAACAAC 

AAT AACG TT AATGCGG AAT T ATC AGG AT T TGGGG ATGG AATG ACCAAT T T TGTT AGC 

GCCTTTTTGGCAAGCTGCAAAAATGATGGGACATTGCCTAATCAAGGGGTTACTTCT 

AAC AC T TGGGGGGCCGG T TGCGCG T ATG TGGG AG AG ACG AT AACGGCT T T AAAT A 

ACAGCATCGCTCACTTTGGCACTCAAGCCGAGCAGATAGAACAAGCCGAAAACATC 

GCTGACACTCTAGTGCATTTCAAATCTAGATACAATGAATTGGGCAACACTTATAAC 

AGCATCACCACCGCGCTCTCCAAAGTCCCTAACGCGCAAAGCTTGCAAAATGTGAT 

GGGAAAAAAGAATAACCCCTATAGCCCACAAGGCATAGAAACCAATTACTATCTCAA 

TCAAAACTCTTACAACCAAATCCAAACCATCAACCAAGAATTAGGCCGTAACCCCTT 

TAGGAAAGTGGGTATTGTCAGCTCTCAAACCAACAATGGCGCCATGAATGGGATCG 

GTATTCAGGTGGGCTATAAGCAATTCTTTGGCCAAAAAAGAAAATGGGGCGCTAGG 

T AT T ACGGCT T T T TTGAT T ACAACCATGCG T TCAT T AAATCCAGCT TCT TC AACTCGG 

CTTCTGACGTATGGACTTATGGTTTTGGAGTGGACGCTCTTTATAATTTCATCAACG 

ATAAAGCCACCAATTTCTTAGGCAAAAACAACAAGCTTTCTGTAGGGCTTTTTGGCG 

GGATTGCGTTAGCGGGCACTTCATGGCTCAATTCTGAATACGTGAATTTAGCCACC 

ATGAATAACGTCTATAACGCTAAAATGGATGTGGCGAACTTCCAGTTCTTATTCAAC 

ATGGG AG TGAGG ATG AAT T T AGCC AG ACCC AAG AAAAAAGGCAGCG ATCATGCGG 

CTCAGCATGGGATTGAGTTAGGGCTTAAAATCCCCACCATCAACACGAACTACTATT 

CCTTTATGGGGGCTGAACTCAAATACAGAAGGCTCTATAGCGTGTATTTGAATTATG 

TGTTCGCTTACTAA 

FIG.8 
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COMBINED DECLARATION AND POWER OF ATTORNEY 
(Original, Design, Supplemental, Divisional, Continuation, CIP) 

As the below named inventor, I hereby declare that: 



INVENTORSHIP IDENTIFICATION 

My residence, post office address and citizenship are as stated below next to my name. I believe 
I am the original, first and sole inventor (if only one name is listed below) or an original, first 
and joint inventor (if plural names are listed below) of the subject matter which is claimed and 
for which a patent is sought on the invention entitled: 

TITLE OF INVENTION 

NOVEL ANTIGENS OF HELICOBACTER PYLORI 

SPECIFICATION IDENTIFICATION 

the specification of which: (complete (a), (b), or (c)) 

(a) D is attached hereto. 

(b) CH was filed on as 

□ Serial Number 

□ Express Mail No. , as Serial Number not yet known 

(c) ^ was described and claimed in PCT International Application No. 

PCT/US99/14375 filed on June 25, 1999 and as amended under PCT Article 
19 on (if any). 



ACKNOWLEDGEMENT OF REVIEW OF PAPERS AND DUTY OF CANDOR 

I hereby state that I have reviewed and understand the contents of the above identified 
specification, including the claims, as amended by any amendment referred to above. 

I acknowledge the duty to disclose information which is material to the patentability of this 
application in accordance with Title 37 CFR 1 .56(a). 
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PRIORITY CLAIM 

I hereby claim foreign priority benefits under Title 35, United States Code, Section 1 19 of any 
foreign application(s) for patent or inventors certificate or of any PCT International 
application(s) designating at least one country other than the United States of America listed 
below and have also identified below any foreign application(s) for patent or inventor's 
certificate of any PCT International application(s) designating at least one country other than the 
United States of America filed by me on the same subject matter having a filing date before that 
of the application(s) of which priority is claimed. 

(d) ^1 No such applications have been filed. 

(e) O Such applications have been filed as follows. 

NOTE: Where item (c) is entered above and the International Application which designated the 
U.S. claimed priority, check item (e), enter the details below and make the priority claim. 



Earliest Foreign Application(s), if any, filed within 12 months (6 months for Design) prior to this 
U.S. Application 





Application No. 


Date of Filing 
(Day, Month, Year) 


Priority Claimed 
35 USC 119 










1 Country 

























All Foreign Application(s), if any, Filed More Than 12 Months 
(6 Months for Design) Prior to This U.S. Application) 
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CLAIM FOR BENEFIT OF PRIOR U.S. PROVISIONAL APPLICATION(S) 

(35U.S.C. § 119(E)) 

I hereby claim the benefit under Title 35, United States Code, § 1 19(e) of any United States 
provisional application(s) listed below: 

PROVISIONAL APPLICATION NUMBER FILING DATE 

60/090,851 June 26, 1998 



CLAIM FOR BENEFIT OF EARLIER U.S./PCT APPLICATION(S) 
(UNDER 35 U.S.C. 120) 

I hereby claim the benefit under Title 35, United States Code, Section 120 of any United States 
application(s) designating the United States of America that is/are listed below and, insofar as 
the subject matter of each of the claims of this application is not disclosed in that/those prior 
application(s) in the manner provided by the first paragraph of Title 35, United States Code, 
Section 1 12, 1 acknowledge the duty to disclose material information as defined in Title 37, 
Code of Federal Regulations, Section 1.56(a) which occurred between the filing date of the prior 
application(s) and the national or PCT International filing date of this application. 



PRIOR U.S. APPLICATIONS OR PCT INTERNATIONAL APPLICATIONS 
DESIGNATING THE U.S. FOR BENEFIT UNDER 35 USC 120 



U.S. Applications 


Status (Check One) 


U.S. Applications 


U.S. Filing Date 


Patented 


Pending 


Abandoned 






□ 


□ 


□ 






□ 


□ 


□ 






□ 


□ 


□ 






□ 


□ 


□ 
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PCX Applications Designating U.S. 


PCT APPLICATION NO. 


PCT FILING DATE 


U.S. SERIAL NO. ASSIGNED 
(if any) 


J PCT/US99/14375 


June 25, 1999 











POWER OF ATTORNEY 

As a named inventor, I hereby appoint the following attorney(s) and/or agent(s) to prosecute this 
application and transact all business in the Patent and Trademark Office connected therewith: 

Elizabeth M. Barnhard, Reg. No. 31,088; Rebecca R. Barrett, Reg. No . 35,152; Egon E. Berg, 
Re £- No, 21,1 17; William H. CafianJ^eg. No.j^SlQ^Milagros A. Cepeda, Reg. No. 2\3£& 
Steven R. Eck, Reg. No. 36,126; Bruce M. Eiseri, Reg. No. 22,847 ; Steven H. Flynn, Reg. No. 

/ 29,629; Alan M. GordonTReg^No. 30,637; Barbara A Gyure, Reg. No. 34,614; John W. 

^£5 J Hogan, Jr., Reg. No. 32,703; Elizabeth A. Hurley, Reg. No^y*59, Adley F. Mandel, Reg, 

V^^ 7 No. 26^942; Gale F. Matthews, Reg. No. 32,269; Joseph M. Mazzarese, Reg. No. 32,803^_ 

Arnold S. Milowsky, Reg. No,,35,28_8: Daniel B. Moran, Reg. No. 41,204; Michael R. Nagy, 
Reg, No. S2>432; Barbara L. Renda, Reg. No. 27,626; M. Andrea Ryan, Reg. No. 28,469, 
Thomas S. Szatkowski, Reg. No. 18,049, George Tarnowski, Reg. No. 21,422^ and"Darryl L. 
Webster, Reg. Ng34226 all of AMERICAN HOME PRODUCTS CORPORATION, One 
Campus Drive, Parsippany, New Jersey 07054. 

PI Attached as part of this declaration and power of attorney is the authorization of the 
above-named attorney(s) to accept and follow instructions from my representative(s). 

SEND CORRESPONDENCE AND TELEPHONE CALLS TO: 
Darryl L. Webster 

American Home Products Corporation 
Patent Law Department 

^s^p^y^NLQ2Q54_. 
Tel. No. (973) 683-2159 



Page 4 of 5 



\ O 'y y s± lj S ib tji „ :i id: je a 

Docket No: ACY33427-00 
Patent 

DECLARATION 

I hereby declare that all statements herein of my own knowledge are true and that all statements 
made on information and belief are believed to be true; and further that these statements are 
made with the knowledge that willful false statements and the like so made are punishable by 
fine or imprisonment, or both, under Section 1001 of Title 18 of the United States Code, and that 
such willful false statements may jeopardize the validity of the application or any patent issued 
thereon. 



SIGNATUREfS) 

Full name of SOLE OR FIRST INVENTOR: James Peter FULGINITI 



R FIRST INVENTOR: James 
r. United States of America 



Inventor's Signature: /j /h^> I &sJhin-y'- Date: /? -/f ^ 



Country of Citizenship: uniiea aiaies 01 America . j 

Residence: 5180 Foster Road, Cajnadajgua,.New York 14424 / 
Post Office Address: 5180 Foster Road, Canadaigua, New York 14424 

Full name of SECOND JOINT INVENTOR: Michael James FISKE 

Inventor's Signature: Ifa^X 'J. ^L^- Date: 



Country of Citizenship: United States of America 
Residence : 167 Wood Run, Roch ester, New York 14612 
Post Office Address: 167 Wood Run, Rochester, New York 14612 

Full name of THIRD JOINT INVENTOR : JDeJbo_rah Ann DILTS 

Inventor's Signature: hJ)(J\nsL^ ^(HL Date: i^tf^ftb 

Country of Citizenship: United States of America ^ \J 

Residence : 112 Country Downs Circle, Fmrgort^New York 14450 / 
Post Office Address: 112 Country Downs Circle, Fairport, New York 14450 

Full name of FOURTH JOINT INVENTOR: 



Inventor's Signature: Date: 

Country of Citizenship: 

Residence : 

Post Office Address: 
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PATENTS 



Applicant(s): 
U.S. Appl'n No. 
Int'l Appl'n No.: 
Int'l Filing Date: 
For: 



UNDER THE PATENT COOPERATION TREATY 

AMERICAN CYANAMID COMPANY Customer Number: 25291 
09/720,560 
PCT/US 99/14375 
25 JUNE 1999 

Novel Antigens of Helicobacter Pylori 

October 1 , 2001 
Notification and Request for Change of Address 



Commissioner for Patents 
Box PCT 

United States Patent and Trademark Office 
Washington, DC 20231 
Attn: John L. Anderson 

S I R: 

Please make of record in this patent application the following 
new correspondence address for all correspondence regarding the above-identified 
National Phase Application: 

Darryl L. Webster 
American Home Products Corporation 
Patent law Department 
5 Giralda Farms 
Madison, NJ 07940 

RespectfLjJi/ submitted, 
Fulqttffffiet al 




Jar^yi L. Webster 
Attorney for Applicants 
Reg. No. 34,276 



American Home Products Corporation 

Patent Law Dept. 

5 Giralda Farms 

Madison, NJ 07940 

Phone: 973-683-2159 

Facsimile: 973-683-21 17 



WO 00/00614 



PCT/US99/14375 



SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Fulginiti, James P. 

Fiske, Michael J. 
Dilts, Deborah A. 



(ii) TITLE OF INVENTION: Novel Antigens of Helicobacter pyl 
(iii) NUMBER OF SEQUENCES: 10 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: American Cyanamid Company 

(B) STREET: One Campus Drive 
<C) CITY: Parsippany 

( D) STATE: New Jersey 

(E) COUNTRY: USA 

(F) ZIP: 07054 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC- DOS /MS -DOS 

CD) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY / AGENT INFORMATION: 

(A) NAME: Webster, Darryl L. 

(B) REGISTRATION NUMBER: 34276 

(C) REFERENCE /DOCKET NUMBER: 33427-00L 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 973-683-2159 

(B) TELEFAX: 973-683-4117 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 08 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Met Lys Lys Thr Leu Leu Leu Ser Leu Ser Leu Ser Leu Ser Phe Leu 
1 5 10 15 
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Leu His Ala Glu Asp Asp Gly Phe Tyr Thr Ser Val Gly Tyr Gin lie 
20 25 30 

Gly Glu Ala Ala Gin Met Val Lys Asn Thr Lys Gly lie Gin Glu Leu 
35 40 45 

Ser Asp Asn Tyr Glu Lys Leu Asn Asn Leu Leu Asn Asn Tyr Ser Thr 
50 55 60 

Leu Asn Thr Leu lie Lys Leu Ser Ala Asp Pro Ser Ala He Asn Asp 
65 70 75 80 

Ala Arg Asp Asn Leu Gly Ser Ser Ser Arg Asn Leu Leu Asp Val Lys 
85 90 95 

Thr Asn Ser Pro Ala Tyr Gin Ala Val Leu Leu Ala Leu Asn Ala Ala 
100 105 110 

Val Gly Leu Trp Gin Val Thr Ser Tyr Ala Phe Thr Ala Cys Gly Pro 
115 120 125 

Gly Ser Asn Glu Asn Ala Asn Gly Gly He Gin Thr Phe Asn Asn Val 
130 135 140 

Pro Gly Gin Asp Thr Thr Thr He Thr Cys Asn Ser Tyr Tyr Glu Pro 
145 150 155 160 

Gly His Gly Gly Pro He Ser Thr Ala Asn Tyr Ala Lys He Asn Gin 
165 170 175 

Ala Tyr Gin He He Gin Lys Ala Leu Thr Ala Asn Gly Ala Asn Gly 
180 185 190 

Asp Gly Val Pro Val Leu Ser Asn Thr Thr Thr Lys Leu Asp Phe Thr 
195 200 205 

He Asn Gly Asp Lys Arg Thr Gly Gly Lys Pro Asn Thr Pro Glu Lys 
210 215 220 

Phe Pro Trp Ser Asp Gly Lys Tyr He His Thr Gin Trp He Asn Thr 
225 230 235 240 

He Val Thr Pro Thr Glu Thr Asn He Asn Thr Glu Asn Asn Ala Gin 
245 250 255 

Glu Leu Leu Lys Gin Ala Ser He He He Thr Thr Leu Asn Glu Ala 
260 265 270 

Cys Pro Asn Phe Gin Asn Gly Gly Arg Ser Tyr Trp Gin Gly He Ser 
275 280 285 

Gly Asn Gly Thr Met Cys Gly Met Phe Lys Asn Glu He Ser Ala He 
290 295 300 

Gin Gly Met He Ala Asn Ala Gin Glu Ala Val Ala Gin Ser Lys He 
305 310 315 320 

Val Ser Glu Asn Ala Gin Asn Gin Asn Asn Leu Asp Thr Gly Lys Pro 



2 



WO 00/00614 



PCT/US99/14375 



325 330 335 

Phe Asn Pro Tyr Thr Asp Ala Ser Phe Ala Gin Ser Met Leu Lys Asn 
340 345 350 

Ala Gin Ala Gin Ala Glu lie Leu Asn Gin Ala Glu Gin Val Val Lys 
355 360 365 

Asn Phe Glu Lys lie Pro Thr Ala Phe Val Ser Asp Ser Leu Gly Val 
370 375 380 

Cys Tyr Glu Val Gin Gly Gly Glu Arg Arg Gly Thr Asn Pro Gly Gin 
385 390 395 400 

Val Thr Ser Asn Thr Trp Gly Ala Gly Cys Ala Tyr Val Lys Gin Thr 
405 410 415 

lie Thr Asn Leu Asp Asn Ser lie Ala His Phe Gly Thr Gin Glu Gin 
420 425 430 

Gin lie Gin Gin Ala Glu Asn lie Ala Asp Thr Leu Val Asn Phe Lys 
435 440 445 

Ser Arg Tyr Ser Glu Leu Gly Asn Thr Tyr Asn Ser lie Thr Thr Ala 
450 455 460 

Leu Ser Lys Val Pro Asn Ala Gin Ser Leu Gin Asn Val Val Ser Lys 
465 470 475 480 

Lys Asn Asn Pro Tyr Ser Pro Gin Gly lie Glu Thr Asn Tyr Tyr Leu 
485 490 495 

Asn Gin Asn Ser Tyr Asn Gin lie Gin Thr lie Asn Gin Glu Leu Gly 
500 505 510 

Arg Asn Pro Phe Arg Lys Val Gly lie Val Asn Ser Gin Thr Asn Asn 
515 520 525 



Gly Ala Met Asn 
530 

Gly Gin Lys Arg 
545 

Asn His Ala Phe 



Trp Thr Tyr Gly 
580 

Lys Ala Thr Asn 

595, 

Phe Gly Gly lie 
610 

Val Asn Leu Ala 
625 



Gly He Gly lie 
535 



Lys Trp Gly Ala 
550 

He Lys Ser Ser 
565 

Phe Gly Ala Asp 



Phe Leu Gly Lys 
600 

Ala Leu Ala Gly 

615 

Thr Val Asn Asn 
630 



Gin Val Gly Tyr 
540 

Arg Tyr Tyr Gly 
555 

Phe Phe Asn Ser 
570 

Ala Leu Tyr Asn 
585 

Asn Asn Lys Leu 



Thr Ser Trp Leu 
620 

Val Tyr Asn Ala 

635 



Lys Gin Phe Phe 



Phe Phe Asp Tyr 
560 

Ala Ser Asp Val 
575 

Phe lie Asn Asp 
590 

Ser Val Gly Leu 

605 

Asn Ser Glu Tyr 



Lys Met Asn Val 
640 
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Ala Asn Phe Gin Phe Leu Phe Asn Met Gly Val Arg Met Asn Leu Ala 
645 650 655 

Arg Ser Lys Lys Lys Gly Ser Asp His Ala Ala Gin His Gly lie Glu 
660 665 670 

Leu Gly Leu Lys lie Pro Thr lie Asn Thr Asn Tyr Tyr Ser Phe Met 
675 680 685 

Gly Ala Glu Leu Lys Tyr Arg Arg Leu Tyr Ser Val Tyr Leu Asn Tyr 
690 695 700 

Val Phe Ala Tyr 
705 

{2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 741 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Lys Lys His lie Leu Ser Leu Ala Leu Gly Ser Leu Leu Val Ser 
15 10 15 

Thr Leu Ser Ala Glu Asp Asp Gly Phe Tyr Thr Ser Val Gly Tyr Gin 
20 25 30 

lie Gly Glu Ala Ala Gin Met Val Thr Asn Thr Lys Gly lie Gin Asp 
35 40 45 

Leu Ser Asp Arg Tyr Glu Ser Leu Asn Asn Leu Leu Thr Arg Tyr Ser 
50 55 60 

Thr Leu Asn Thr Leu lie Lys Leu Ser Ala Asp Pro Ser Ala lie Asn 
65 70 75 80 

Ala Ala Arg Glu Asn Leu Gly Ala Ser Ala Lys Asn Leu lie Gly Asp 
85 90 95 

Lys Ala Asn Ser Pro Ala Tyr Gin Ala Val Leu Leu Ala lie Asn Ala 
100 105 110 

Ala Val Gly Phe Trp Asn Val Leu Gly Tyr Ala Thr Gin Cys Gly Gly 
115 120 125 

Asn Ala Asn Gly Gin Lys Ser Thr Ser Ser Thr Thr lie Phe Asn Asn 
130 135 140 

Glu Pro Gly Tyr Arg Ser Thr Ser lie Thr Cys Ser Leu Asn Gly Tyr 
145 150 155 160 
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Thr Pro Gly Tyr Tyr Gly Pro Met Ser lie Glu Asn Phe Lys Lys Leu 
165 170 175 

Asn Glu Ala Tyr Gin lie Leu Gin Thr Ala Leu Lys Gin Gly Leu Pro 
180 185 190 

Ala Leu Lys Glu Asn Asn Lys Lys Val Asn Val Thr Tyr Thr Tyr Thr 
195 200 205 

Cys Ser Gly Gly Gly Asn Asn Asn Cys Ser Ser Glu Ala Thr Gly Val 
210 215 220 

Ser Asn Gin Asn Gly Gly Thr Lys Thr Thr Thr Gin Thr lie Asp Gly 
225 230 235 240 

Lys Ser Val Thr Thr Thr lie Ser Ser Lys Val Val Asp Ser Thr Ala 
245 250 255 

Ser Gly Asn Thr Ser Arg Val Ser Tyr Thr Glu He Thr Asn Lys Leu 
260 265 270 

Glu Gly Val Pro Asp Ser Ala Gin Ala Leu Leu Ala Gin Ala Ser Thr 
275 280 285 

Leu He Ser Thr He Asn Thr Ala Cys Pro Phe Phe Ser Val Thr Asn 
290 295 300 

Gin Ser Gly Gly Pro Gin Met Glu Pro Thr Lys Gly Lys Leu Cys Gly 
305 310 315 320 

Phe Thr Glu Glu He Ser Ala He Gin Lys Met He Thr Asp Ala Gin 
325 330 335 

Glu Leu Val Asn Gin Thr Ser Val lie Asn Ser His Glu Gin Ser Thr 
340 345 350 

Leu Val Gly Gly Asn Asn Gly Lys Pro Phe Asn Pro Phe Thr Asp Ala 
355 360 365 

Gin Phe Ala Gin Gly Met Leu Ala Asn Ala Ser Ala Gin Ala Lys Met 
370 375 380 

Leu Asn Leu Ala His Gin Val Gly Gin Thr lie Asn Pro Asn Asn Leu 
385 390 395 400 

Thr Gly Asn Phe Lys Asn Phe Val Thr Gly Phe Leu Ala Thr Cys Asn 
405 410 415 

Asn Pro Ser Thr Ala Gly Thr Gly Gly Thr Gin Gly Ser Ala Pro Gly 
420 425 430 

Thr Val Thr Thr Gin Thr Phe Ala Ser Gly Cys Ala Tyr Val Glu Gin 
435 440 445 

Thr He Thr Asn Leu Glu Asn Ser lie Ala His Phe Gly Thr Gin Glu 
450 455 460 

Gin Gin lie Gin Arg Ala Glu Asn lie Ala Asp Thr Leu Val Asn Phe 
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465 

Lys Ser Arg Tyr 



Ala Leu Ser Lys 
500 

Lys Lys Asn Asn 
515 

Leu Asn Gin Asn 
530 

Gly Arg Asn Pro 
545 



470 

Ser Glu Leu Gly 
485 

Val Pro Asn Ala 



Pro Tyr Ser Pro 
520 



Ser Tyr Asn Gin 
535 

Phe Arg Lys Val 
550 



475 

Asn Thr Tyr Asn 
490 

Gin Ser Leu Gin 
505 

Gin Gly lie Glu 



He Gin Thr He 
540 

Gly He Val Gly 
555 



480 

Ser He Thr Thr 
495 

Asn Val Val Ser 
510 

Thr Asn Tyr Tyr 
525 

Asn Gin Glu Leu 



Ser Gin Thr Asn 
560 



Asn Gly Ala Met Asn Gly He Gly He Gin Val Gly Tyr Glu Gin Phe 
565 570 575 

Phe Gly Gin Lys Arg Lys Trp Gly Ala Arg Tyr Tyr Gly Phe Phe Asp 
580 585 590 

Tyr Asn His Ala Phe He Lys Ser Ser Phe Phe Asn Ser Ala Ser Asp 
595 600 605 

Val Trp Thr Tyr Gly Phe Gly Ala Asp Ala Leu Tyr Asn Phe He Asn 
610 615 620 

Asp Lys Ala Thr Asn Phe Leu Gly Lys Asn Asn Lys Leu Ser Val Gly 
625 630 635 640 

Leu Phe Gly Gly He Ala Leu Ala Gly Thr Ser Trp Leu Asn Ser Glu 
645 650 655 

Tyr Val Asn Leu Ala Thr Val Asn Asn Val Tyr Asn Ala Lys Met Asn 
660 665 670 

Val Ala Asn Phe Gin Phe Leu Phe Asn Met Gly Val Arg Met Asn Leu 
675 680 685 

Ala Arg Pro Lys Lys Asn Asp Ser Asp His Ala Ala Gin His Gly He 
690 695 700 

Glu Leu Gly Leu Lys He Pro Thr He Asn Thr Asn Tyr Tyr Ser Phe 
705 710 715 720 

Met Gly Ala Glu Leu Lys Tyr Arg Arg Leu Tyr Ser Val Tyr Leu Asn 
725 730 735 

Tyr Val Phe Ala Tyr 
740 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 5 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Lys Lys His lie Leu Ser Leu Ala Leu Gly Ser Leu Leu Val Ser 
1 5 10 15 

Thr Leu Ser Ala Glu Asp Asp Gly Phe Tyr Thr Ser Val Gly Tyr Gin 
20 25 30 

lie Gly Glu Ala Ala Gin Met Val Thr Asn Thr Lys Gly He Gin Gin 
35 40 45 

Leu Ser Asp Asn Tyr Glu Asn Leu Asn Asn Leu Leu Thr Arg Tyr Ser 
50 55 60 

Thr Leu Asn Thr Leu He Lys Leu Ser Ala Asp Pro Ser Ala He Asn 
65 70 75 80 

Ala Val Arg Glu Asn Leu Gly Ala Ser Thr Lys Asn Leu He Gly Asp 
85 90 95 

Lys Ala Asn Ser Pro Ala Tyr Gin Ala Val Phe Leu Ala He Asn Ala 
100 105 110 

Ala Val Gly Leu Trp Asn Thr He Gly Tyr Ala Val Met Cys Gly Asn 
115 120 125 

Gly Asn Gly Thr Glu Ser Gly Pro Gly Ser Val lie Phe Asn Asp Gin 
130 135 140 

Pro Gly Gin Asp Ser Thr Gin He Thr Cys Asn Arg Phe Glu Ser Thr 
145 150 155 160 

Gly Pro Gly Lys Ser Met Ser He Asp Glu Phe Lys Lys Leu Asn Glu 

165 170 175 

Ala Tyr Gin He He Gin Gin Ala Leu Lys Asn Gin Ser Gly Phe Pro 
180 185 190 

Glu Leu Gly Gly Asn Gly Thr Lys Val Ser Val Asn Tyr Asn Tyr Glu 
195 200 205 

Cys Arg Gin Thr Ala Asp He Asn Gly Gly Val Tyr Gin Phe Cys Lys 
210 215 220 

Ala Lys Asn Gly Ser Ser Ser Ser Ser Asn Gly Gly Asn Gly Ser Ser 
225 230 235 240 

Thr Gin Thr Thr Ala Thr Thr Thr Gin Asp Gly Val Thr He Thr Thr 
245 250 255 

Thr Tyr Asn Asn Asn Lys Ala Thr Val Lys Phe Asp He Thr Asn Asn 
260 265 270 
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Ala Glu Gin Leu Leu Asn Gin Ala Ala Asn lie Met Gin Val Leu Asn 
275 280 285 

Thr Gin Cys Pro Leu Val Arg Ser Thr Asn Asn Glu Asn Thr Pro Gly 
290 295 300 

Gly Gly Gin Pro Trp Gly Leu Ser Thr Ser Gly Asn Ala Cys Ser lie 
305 310 315 320 

Phe Gin Gin Glu Phe Ser Gin Val Thr Ser Met lie Lys Asn Ala Gin 
325 330 335 

Glu He He Ala Gin Ser Lys He Val Ser Glu Asn Ala Gin Asn Gin 
340 345 350 

Asn Asn Leu Asp Thr Gly Lys Pro Phe Asn Pro Tyr Thr Asp Ala Ser 
355 360 365 

Phe Ala Gin Ser Met Leu Lys Asn Ala Gin Ala Gin Ala Glu Met Phe 
370 375 380 

Asn Leu Ser Glu Gin Val Lys Lys Asn Leu Glu Val Met Lys Asn Asn 
385 390 395 400 

Asn Asn Val Asn Glu Lys Leu Ala Gly Phe Gly Lys Glu Glu Val Met 
405 410 415 

Thr Asn Phe Val Ser Ala Phe Leu Ala Ser Cys Lys Asp Gly Gly Thr 
420 425 430 

Leu Pro Asn Ala Gly Val Thr Ser Asn Thr Trp Gly Ala Gly Cys Ala 
435 440 445 

Tyr Val Gly Glu Thr He Ser Ala Leu Thr Asn Ser He Ala His Phe 
450 455 460 

Gly Thr Gin Glu Gin Gin lie Gin Gin Ala Glu Asn lie Ala Asp Thr 
465 470 475 480 

Leu Val Asn Phe Lys Ser Arg Tyr Ser Glu Leu Gly Asn Thr Tyr Asn 
485 490 495 

Ser He Thr Thr Ala Leu Ser Lys Val Pro Asn Ala Gin Ser Leu Gin 
500 505 510 

Asn Val Val Ser Lys Lys Asn Asn Pro Tyr Ser Pro Gin Gly lie Glu 
515 520 525 

Thr Asn Tyr Tyr Leu Asn Gin Asn Ser Tyr Asn Gin He Gin Thr lie 
530 535 540 

Asn Gin Glu Leu Gly Arg Asn Pro Phe Arg Lys Val Gly He Val Asn 
545 550 555 560 

Ser Gin Thr Asn Asn Gly Ala Met Asn Gly lie Gly He Gin Val Gly 
565 570 575 

Tyr Lys Gin Phe Phe Gly Gin Lys Arg Lys Trp Gly Ala Arg Tyr Tyr 
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580 

Gly Phe Phe Asp Tyr 
595 

Ser Ala Ser Asp Val 
610 

Asn Phe lie Asn Asp 
625 

Leu Ser Leu Gly Leu 
645 



Leu Asn Ser Glu Tyr 
660 

Ala Lys Met Asn Val 
675 

Arg Met Asn Leu Ala 
690 

Gin His Gly lie Glu 
705 

Tyr Tyr Ser Phe Met 
725 



585 

Asn His Ala Phe lie Lys 
600 

Trp Thr Tyr Gly Phe Gly 
615 

Lys Ala Thr Asn Phe Leu 
630 635 

Phe Gly Gly He Ala Leu 
650 

Val Asn Leu Ala Thr Val 
665 

Ala Asn Phe Gin Phe Leu 
680 

Arg Ser Lys Lys Lys Gly 
695 

Leu Gly Leu Lys He Pro 
710 715 

Gly Ala Glu Leu Lys Tyr 
730 



590 

Ser Ser Phe Phe Asn 
605 

Ala Asp Ala Leu Tyr 
620 

Gly Lys Asn Asn Lys 
640 

Ala Gly Thr Ser Trp 

655 

Asn Asn Val Tyr Asn 
670 

Phe Asn Met Gly Val 
685 

Ser Asp His Ala Ala 
700 

Thr He Asn Thr Asn 
720 



Arg Arg Leu Tyr Ser 
735 



Val Tyr Leu Asn Tyr Val Phe Ala Tyr 
740 745 

(2) INFORMATION FOR SEQ ID NO: 4: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2127 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



ATGAAAAAAA 


CCCTTTTACT 


CTCTCTCTCT 


CTCTCTCTCT 


CGTTTTTGCT 


CCACGCTGAA 


60 


GACGACGGCT 


TTTACACAAG 


CGTGGGCTAT 


CAAATCGGTG 


AAGCCGCTCA 


AATGGTGAAA 


120 


AACACCAAAG 


GCATTCAAGA 


GCTTTCAGAC 


AATTATGAAA 


AGCTGAACAA 


TCTTTTGAAT 


180 


AATTACAGCA 


CCCTAAACAC 


CCTTATCAAA 


TTGTCCGCTG 


ATCCGAGCGC 


GATTAACGAC 


240 


GCAAGGGATA 


ATCTAGGCTC 


AAGCTCTAGG 


AATTTGCTTG 


ATGTCAAAAC 


CAATTCCCCC 


300 


GCGTATCAAG 


CCGTGCTTTT 


AGCACTCAAT 


GCTGCAGTGG 


GGTTGTGGCA 


AGTTACAAGC 


360 


TACGCTTTTA 


CTGCTTGTGG 


TCCTGGCAGT 


AACGAGAATG 


CGAATGGAGG 


GATCCAAACT 


420 
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TTTAATAATG 


TGCCAGGACA 


AGATACGACG 


ACCATCACTT 


GCAATTCGTA 


TTATGAGCCA 


480 


GGACATGGTG 


GGCCTATATC 


CACTGCAAAT 


TATGCGAAAA 


TCAATCAAGC 


CTATCAAATC 


540 


ATCCAAAAGG 


CTTTGACAGC 


CAATGGAGCT 


AATGGAGATG 


GGGTCCCCGT 


TTTAAGCAAC 


600 


AC C ACT AC AA 


AACTTGATTT 


CACTATCAAT 


GGAGACAAAA 


GAACGGGGGG 


CAAACCAAAT 


660 


ACACCTGAAA 


AGTTCCCATG 


GAGTGATGGG 


AAATATATTC 


ACACCCAATG 


GATTAACACA 


720 


ATAGTAACAC 


CAACAGAAAC 


AAATATCAAC 


ACAGAAAATA 


ACGCTCAAGA 


GCTTTTAAAA 


780 


CAAGCGAGCA 


TCATTATCAC 


TACCCTAAAT 


GAGGCATGCC 


CAAACTTCCA 


AAATGGTGGT 


840 


AGAAGTTATT 


GGCAAGGGAT 


AAGCGGCAAT 


GGGACAATGT 


GCGGGATGTT 


TAAGAATGAA 


900 


ATCAGCGCGA 


TCCAAGGCAT 


GATCGCTAAC 


GCTCAAGAAG 


CTGTCGCGCA 


AAGCAAAATC 


960 


GTTAGTGAAA 


ACGCGCAAAA 


TCAAAACAAC 


TTGGATACTG 


GAAAACCATT 


CAACCCTTAC 


1020 


ACGGACGCCA 


GCTTTGCGCA 


AAGCATGCTC 


AAAAACGCTC 


AAGCGCAAGC 


AGAGATTTTA 


1080 


AACCAAGCCG 


AACAAGTAGT 


AAAAAACTTT 


GAAAAAATCC 


CTACAGCCTT 


TGTATCAGAC 


1140 


TCTTTAGGGG 


TGTGTTATGA 


AGTGCAAGGG 


GGTGAGCGTA 


GGGGCACCAA 


TCCAGGTCAG 


1200 


GTAACTTCTA 


ACACTTGGGG 


AGCCGGTTGC 


GCGTATGTGA 


AACAAACCAT 


AACGAATTTA 


1260 


GACAACAGCA 


TCGCTCACTT 


TGGCACTCAA 


GAGCAGCAGA 


TACAGCAAGC 


CGAAAACATC 


1320 


GCTGACACTC 


TAGTGAATTT 


CAAATCTAGA 


TACAGCGAAT 


TAGGCAACAC 


C TAT AAC AG C 


1380 


ATCACCACCG 


CGCTCTCCAA 


AGTCCCTAAC 


GCGCAAAGCT 


TGCAAAACGT 


GGTGAGCAAA 


1440 


AAGAATAACC 


CCTATAGCCC 


TCAAGGCATA 


GAGACCAATT 


ACTACCTCAA 


TCAAAATTCT 


1500 


T AC AAC C AAA 


TCCAAACCAT 


CAACCAAGAA 


CTAGGGCGTA 


ACCCCTTTAG 


GAAAGTGGGC 


1560 


ATCGTCAATT 


CTCAAACCAA 


CAATGGTGCC 


ATGAATGGGA 


TCGGTATTCA 


GGTGGGCTAT 


1620 


AAGCAATTCT 


TTGGCCAAAA 


AAGAAAATGG 


GGCGCTAGGT 


ATTACGGCTT 


TTTTGACTAC 


1680 


AACCATGCGT 


TCATTAAATC 


CAGCTTCTTC 


AACTCGGCTT 


CTGATGTGTG 


GACTTATGGT 


1740 


TTTGGAGCGG 


ACGCTCTTTA 


TAACTTCATC 


AACGATAAAG 


CCACCAATTT 


CTTAGGCAAA 


1800 


AACAACAAGC 


TTTCCGTGGG 


GCTTTTTGGA 


GGGATTGCGT 


TAGCGGGCAC 


TTCATGGCTT 


1860 


AATTCTGAGT 


ATGTGAATTT 


AGCCACCGTG 


AATAACGTCT 


ATAACGCTAA 


AATGAATGTG 


1920 


GCGAATTTCC 


AATTCTTATT 


CAATATGGGA 


GTGAGGATGA 


ATTTAGCCAG 


ATCCAAGAAA 


1980 


AAAGGCAGCG 


ATCATGCGGC 


TCAGCATGGG 


AT TG AAC TAG 


GGCTTAAAAT 


CCCCACCATC 


2040 


- AACACGAACT 


ATTATTCTTT 


CATGGGGGCT 


GAACTCAAAT 


ACAGAAGGCT 


TTATAGCGTG 


2100 


TATTTGAATT 


ATGTGTTCGC 


TTACTAA 








2127 
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(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2226 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



ATGAAAAAAC 


ACATCCTTTC 


ATTAGCTTTA 


GGCTCGCTTT 


TAGTTTCCAC 


TTTGAGCGCT 


60 


GAAGACGACG 


GCTTTTACAC 


AAGCGTGGGC 


TATCAAATCG 


GTGAAGCCGC 


TCAAATGGTA 


120 


ACAAACACCA 


AAGGCATTCA 


AGATCTTTCA 


GATCGTTATG 


AAAGTTTGAA 


CAACCTTTTG 


180 


ACCCGATACA 


GCACCCTAAA 


CACCCTGATC 


AAATTGTCCG 


CTGATCCGAG 


CGCGATTAAT 


240 


GCGGCGCGTG 


AAAATCTGGG 


CGCGAGCGCG 


AAGAATTTGA 


TCGGCGATAA 


AGCCAATTCC 


300 


CCAGCCTATC 


AAGCGGTGCT 


TTTAGCGATC 


AACGCGGCGG 


TAGGGTTTTG 


GAATGTCTTA 


360 


GGCTATGCTA 


CGCAATGCGG 


GGGTAACGCC 


AATGGTCAAA 


AAAGCACCTC 


TTCAACGACC 


420 


ATCTTCAACA 


ACGAGCCAGG 


GTATCGATCC 


ACTTCCATCA 


CTTGCTCTTT 


GAACGGGTAT 


480 


ACGCCTGGAT 


ACTATGGCCC 


TAT GAG TAT T 


GAGAATTTCA 


AAAAGCTTAA 


CGAAGCCTAT 


540 


CAGATCCTCC 


AAACGGCGTT 


AAAACAAGGC 


TTACCCGCGC 


TCAAAGAAAA 


CAACAAGAAG 


bUU 


GTCAATGTTA 


CCTACACTTA 


CACATGCTCA 


GGGGGAGGGA 


ATAATAACTG 


CTCGTCAGAA 


660 


GCCACAGGTG 


TAAGCAATCA 


AAATGGCGGA 


ACTAAAACCA 


CCACCCAAAC 


CATAGACGGC 


720 


AAAAGCGTAA 


CCACCACGAT 


CAGTTCAAAA 


GTCGTTGATA 


GCACAGCGAG 


TGGTAACACA 


780 


TCACGTGTCT 


CCTACACCGA 


AATCACCAAC 


AAATTAGAAG 


GTGTGCCTGA 


TAGCGCTCAA 


840 


GCGCTCTTAG 


CGCAAGCGAG 


CACGCTCATT 


AGCACCATCA 


ACACGGCATG 


CCCGTTTTTT 


900 


AGTGTAACTA 


ATCAAAGTGG 


TGGTCCACAG 


ATGGAACCGA 


CTAAAGGGAA 


GTTGTGCGGT 


960 


TTTACAGAAG 


AAATCAGCGC 


GATCCAAAAG 


AT GAT C AC AG 


AC GCGCAAG A 


GCTGGTCAAT 


1020 


CAAACGAGCG 


TCATTAATAG 


CCATGAACAA 


TCAACCCTAG 


TGGGCGGTAA 


TAATGGCAAG 


1080 


CCTTTCAACC 


CTTTCACGGA 


CGCTCAATTC 


GCTCAAGGCA 


TGCTCGCTAA 


CGCTAGCGCG . 


1140 


CAAGCTAAAA 


TGCTCAATTT 


AGCCCATCAA 


GTGGGGCAAA 


CCATTAACCC 


TAACAATCTT 


1200 


ACTGGGAATT 


TTAAAAATTT 


TGTTACAGGC 


TTTTTAGCCA 


CATGCAACAA 


CCCCTCAACA 


1260 


GCTGGCACTG 


GTGGCACACA 


AGGTTCAGCT 


CCAGGCACGG 


TTACCACTCA 


AACTTTCGCT 


1320 


TCCGGTTGCG 


CGTATGTGGA 


ACAAACCATA 


ACGAATTTAG 


AAAACAGCAT 


CGCGCACTTT 


1380 
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GGCACTCAAG AGCAGCAAAT ACAACGAGCC GAAAATATCG CTGACACTCT AGTGAATTTC 14 4 0 

AAATCTAGAT ACAGCGAATT GGGGAATACT T AC AAC AG C A TCACCACTGC GCTCTCCAAA 1500 

GTCCCTAACG CGCAAAGCTT GCAAAACGTG GTGAGCAAAA AGAATAACCC CTATAGCCCG 1560 

CAAGGCATAG AAACCAATTA CTACTTGAAT CAAAATTCTT ACAACCAAAT CCAAACCATC 1620 

AACCAAGAAT TAGGGCGTAA CCCTTTTAGG AAAGTGGGCA TCGTCGGCTC TCAAACCAAC 1680 

AACGGCGCCA TGAATGGGAT CGGTATTCAG GTGGGCTACG AGCAATTCTT TGGCCAAAAA 174 0 

AGAAAATGGG GCGCTAGGTA TTACGGCTTT TTTGATTACA ACCATGCGTT TATTAAATCC 1800 

AGCTTCTTCA ACTCGGCTTC TGATGTGTGG ACTTATGGTT TTGGAGCGGA CGCTCTCTAT 18 60 

AACTTCATCA ACGATAAAGC CACTAACTTT T TAG G C AAAA ACAACAAGCT TTCTGTGGGG 1920 

CTTTTTGGCG GGATTGCGTT AGCGGGCACT TCATGGCTTA ATTCTGAGTA TGTGAATTTA 1980 

GCCACCGTGA ATAATGTCTA TAACGCTAAA ATGAACGTGG CGAACTTCCA ATTCTTATTC 2 04 0 

AACATGGGAG TGAGGATGAA TTTGGCCAGG CCCAAGAAAA ACGACAGCGA TCATGCGGCT 2100 

CAGCATGGGA TTGAGTTAGG GCTTAAAATC CCCACCATCA ACACGAACTA CTATTCCTTT 2160 

ATGGGGGCTG AACTCAAATA CAGAAGGCTT TATAGCGTGT ATTTGAATTA TGTGTTCGCT 2220 

TACTAG 222 6 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2238 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

ATGAAAAAAC ACATCCTTTC ATTAGCTTTA GGCTCGCTTT TAGTTTCCAC TTTGAGCGCT 60 

GAAGACGACG GCTTTTACAC AAGCGTAGGC TATCAGATCG GTGAAGCCGC TCAAATGGTA 120 

ACAAACACCA AAGGCATCCA ACAGCTTTCA GACAATTATG AAAATTTGAA CAACCTTTTA 180 

ACGAGATACA GCACCCTAAA CACCCTTATC AAATTGTCCG CTGATCCGAG CGCAATTAAT 24 0 

GCGGTGCGGG AAAATCTGGG CGCGAGCACG AAGAATTTGA TCGGCGATAA AGCCAACTCC 300 

CCGGCGTATC AAGCCGTGTT TTTAGCGATC AACGCGGCGG TAGGGTTGTG GAATACCATC 360 

GGCTATGCGG TCATGTGCGG GAACGGGAAC GGCACAGAGA GTGGGCCTGG CAGCGTGATC 4 20 
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TTTAATGACC 


AACCAGGACA 


GGATTCCACG 


CAAATTACTT 


GCAACCGCTT 


TGAATCAACT 


480 


GGGCCTGGTA 


AAAGCATGTC 


TATTGATGAA 


TTCAAAAAAC 


TCAATGAAGC 


CTATCAAATC 


540 


ATCCAGCAAG 


CTTTAAAAAA 


TCAAAGTGGG 


TTTCCTGAAT 


TAGGCGGGAA 


CGGCACAAAA 


600 


GTGAGTGTTA 


ATTACAATTA 


CGAATGCAGA 


CAAACTGCTG 


ATATCAACGG 


CGGTGTGTAT 


660 


CAGTTCTGCA 


AGGCTAAAAA 


TGGTAGTAGT 


AGCAGTAGTA 


ATGGCGGTAA 


TGGCAGTAGC 


720 


ACGCAAACAA 


CCGCGACAAC 


CACGCAAGAC 


GGCGTAACGA 


TCACCACTAC 


CTATAATAAT 


780 


AACAAAGCCA 


CCGTCAAATT 


TGACATCACC 


AATAACGCTG 


AACAGCTGTT 


AAATCAAGCG 


840 


GCAAACATCA 


TGCAAGTCCT 


TAATACGCAA 


TGCCCTTTAG 


TGCGTTCCAC 


GAATAACGAA 


900 


AACACTCCAG 


GGGGTGGTCA 


ACCATGGGGT 


TTAAGCACAT 


CCGGGAATGC 


GTGCAGCATC 


960 


TTCCAACAAG 


AATTTAGCCA 


GGTTACTAGC 


ATGATCAAAA 


ACGCCCAAGA 


AATAATCGCG 


1020 


CAAAGCAAAA 


TCGTTAGTGA 


AAACGCGCAA 


AATCAAAACA 


ACTTGGATAC 


TGGAAAACCA 


1080 


TTCAACCCTT 


ACACGGACGC 


CAGCTTTGCG 


CAAAGCATGC 


TCAAAAACGC 


TCAAGCGCAA 


1140 


GCAGAGATGT 


TCAATTTGAG 


CGAACAAGTG 


AAAAAGAACT 


TGGAAGTCAT 


GAAAAACAAC 


1200 


AATAATGTTA 


ACGAGAAATT 


AGCAGGATTT 


GGGAAAGAAG 


AAGTAATGAC 


CAATTTTGTT 


1260 


AGCGCCTTTT 


TGGCAAGCTG 


CAAAGATGGT 


GGCACATTGC 


CTAATGCAGG 


GGTTACTTCT 


1320 


AACACTTGGG 


GGGCGGGTTG 


CGCGTATGTG 


GGAGAGACGA 


TAAGCGCCCT 


AACCAACAGC 


1380 


ATCGCTCACT 


TTGGCACTCA 


AGAGCAGCAG 


ATACAGCAAG 


CCGAAAACAT 


CGCTGACACT 


1440 


CTAGTGAATT 


TCAAATCTAG 


AT AC AG C G AA 


TTAGGCAACA 


CCTATAACAG 


CATCACCACC 


1500 


GCGCTCTCCA 


AAGTCCCTAA 


CGCGCAAAGC 


TTGCAAAACG 


TGGTGAGCAA 


AAAGAATAAC 


1560 


CCCTATAGCC 


CTCAAGGCAT 


AGAGACCAAT 


TACTACCTCA 


ATCAAAATTC 


TTACAACCAA 


1620 


ATCCAAACCA 


TCAACCAAGA 


ACTAGGGCGT 


AACCCCTTTA 


GGAAAGTGGG 


CATCGTCAAT 


1680 


TCTCAAACCA 


ACAATGGTGC 


CATGAATGGG 


ATCGGCATTC 


AGGTGGGCTA 


TAAGCAATTC 


1740 


TTTGGCCAAA 


AAAGAAAATG 


GGGCGCTAGG 


TATTACGGCT 


TTTTTGATTA 


CAACCATGCG 


1800 


TTCATCAAAT 


CCAGCTTTTT 


CAACTCGGCT 


m m 7\ /i /~* rn f~+ rn 

TCTGACGTGT 


r~~" i\ Ti rp t\ rr\{~*f~* 

GGACTTAHjtj 


1111 bbHbtb 


i ft fin 


GACGCGCTTT 


ATAACTTCAT 


C AAC GAT AAA 


GCCACCAATT 


TCTTAGGCAA 


AAACAACAAG 


1920 


CTTTCTTTGG 


GGCTTTTTGG 


CGGGATTGCG 


TTAGCGGGCA 


CTTCATGGCT 


CAATTCTGAG . 


1980 


TACGTGAATT 


TAGCCACCGT 


GAATAACGTC 


TATAACGCTA 


AAATGAATGT 


GGCGAATTTC 


2040 


CAATTCTTAT 


TCAATATGGG 


AGTGAGGATG 


AATTTAGCCA 


GATCCAAGAA 


AAAAGGCAGC 


2100 


GATCATGCAG 


CTCAGCATGG 


GATTGAGTTA 


GGGCTTAAAA 


TCCCCACCAT 


CAACACGAAC 


2160 


TATTATTCCT 


TTATGGGGGC 


TGAACTCAAA 


TACAGAAGGC 


TCTATAGCGT 


GTATTTGAAC 


2220 
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TATGTGTTCG CTTACTAA 2238 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Glu Asp Asp Gly Phe Tyr Thr Ser Val Gly Tyr Gin lie Gly Glu Ala 
15 10 15 

Ala Gin Met Val 
20 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
GARGAYGAYG GNTTYTAYAC NWSNGTNGGN TAYCARATHG GNGARGCNGC N CARAT GGTN 60 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 2 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
G G C CAT AT G A AAAAACACAT CCTTTCATTA GCTTTAGGCT CG 4 2 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 

"(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GGCAAGCTTG GGAGTTTCAC AAAAAGCTTA GTAAGCGAAC AC 



42 
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<12 0> NOVEL ANTIGENS OF HELICOBACTER PYLORI 



<130> 0646/89939 

<140> PCT/US99/14375 
<141> 1999-06-25 

<150> 60/090,851 
<151> 1998-06-26 

<160> 42 

<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 708 
<212> PRT 

<213> Helicobacter pylori 
<400> 1 



Met 


Lys 


Lys 


Thr 


Leu 


Leu 


Leu 


Ser 


Leu 


Ser 


Leu 


Ser 


Leu 


Ser 


Phe 


Leu 


1 




5 










10 










15 




Leu 


His 


Ala 


Glu 


Asp 


Asp 


Gly 


Phe 


Tyr 


Thr 


Ser 


Val 


Gly Tyr 


Gin 


He 








20 










25 










30 






Gly 


Glu 


Ala 


Ala 


Gin 


Met 


Val 


Lys 


Asn 


Thr 


Lys 


Gly 


He 


Gin 


Glu 


Leu 




35 










40 










45 








Ser 


Asp 
50 


Asn 


Tyr 


Glu 


Lys 


Leu 
55 


Asn 


Asn 


Leu 


Leu 


Asn 
60 


Asn 


Tyr 


Ser 


Thr 


Leu 


Asn 


Thr 


Leu 


He 


Lys 


Leu 


Ser 


Ala 


Asp 


Pro 


Ser 


Ala 


He 


Asn 


Asp 


65 










70 










75 








Val 


80 


Ala 


Arg 


Asp 


Asn 


Leu 


Gly 


Ser 


Ser 


Ser 


Arg 


Asn 


Leu 


Leu 


Asp 


Lys 






85 










90 










95 




Thr 


Asn 


Ser 


Pro 
100 


Ala 


Tyr 


Gin 


Ala 


Val 
105 


Leu 


Leu 


Ala 


Leu 


Asn 
110 


Ala 


Ala 


Val 


Gly 


Leu 


Trp 


Gin 


Val 


Thr 


Ser 


Tyr 


Ala 


Phe 


Thr 


Ala 


Cys 


Gly 


Pro 




115 










120 










125 






Val 


Gly 


Ser 


Asn 


Glu 


Asn 


Ala 


Asn 


Gly 


Gly 


He 


Gin 


Thr 


Phe 


Asn 


Asn 


130 










135 










140 










Pro 


Gly 


Gin 


Asp 


Thr 


Thr 


Thr 


He 


Thr 


Cys 


Asn 


Ser 


Tyr 


Tyr 


Glu 


Pro 


145 






150 










155 










160 


Gly 


His 


Gly 


Gly 


Pro 


He 


Ser 


Thr 


Ala 


Asn 


Tyr 


Ala 


Lys 


He 


Asn 


Gin 






165 










170 










175 




Ala 


Tyr 


Gin 


He 


He 


Gin 


Lys 


Ala 


Leu 


Thr 


Ala 


Asn 


Gly 


Ala 


Asn 


Gly 






180 










185 










190 






Asp 


Gly 


Val 


Pro 


Val 


Leu 


Ser 


Asn 


Thr 


Thr 


Thr 


Lys 


Leu 


Asp 


Phe 


Thr 


195 










200 










205 








He 


Asn 
210 


Gly 


Asp 


Lys 


Arg 


Thr 
215 


Gly 


Gly 


Lys 


Pro 


Asn 
220 


Thr 


Pro 


Glu 


Lys 




Phe Pro Trp Ser Asp Gly Lys Tyr lie His Thr Gin Trp He Asn Thr 
225 230 235 240 

He Val Thr Pro Thr Glu Thr Asn He Asn Thr Glu Asn Asn Ala Gin 

245 250 255 

Glu Leu Leu Lys Gin Ala Ser He lie He Thr Thr Leu Asn Glu Ala 

260 265 270 

Cys Pro Asn Phe Gin Asn Gly Gly Arg Ser Tyr Trp Gin Gly He Ser 

275 280 285 

Gly Asn Gly Thr Met Cys Gly Met Phe Lys Asn Glu He Ser Ala lie 

290 295 300 

Gin Gly Met He Ala Asn Ala Gin Glu Ala Val Ala Gin Ser Lys He 
305 310 315 320 

Val Ser Glu Asn Ala Gin Asn Gin Asn Asn Leu Asp Thr Gly Lys Pro 

325 330 335 

Phe Asn Pro Tyr Thr Asp Ala Ser Phe Ala Gin Ser Met Leu Lys Asn 

340 345 350 

Ala Gin Ala Gin Ala Glu He Leu Asn Gin Ala Glu Gin Val Val Lys 

355 360 365 

Asn Phe Glu Lys He Pro Thr Ala Phe Val Ser Asp Ser Leu Gly Val 

370 375 380 

Cys Tyr Glu Val Gin Gly Gly Glu Arg Arg Gly Thr Asn Pro Gly Gin 
385 390 395 400 

Val Thr Ser Asn Thr Trp Gly Ala Gly Cys Ala Tyr Val Lys Gin Thr 

405 410 415 

lie Thr Asn Leu Asp Asn Ser He Ala His Phe Gly Thr Gin Glu Gin 

420 425 430 

Gin He Gin Gin Ala Glu Asn lie Ala Asp Thr Leu Val Asn Phe Lys 

435 440 445 

Ser Arg Tyr Ser Glu Leu Gly Asn Thr Tyr Asn Ser He Thr Thr Ala 

450 455 460 

Leu Ser Lys Val Pro Asn Ala Gin Ser Leu Gin Asn Val Val Ser Lys 
465 470 475 480 ' 

Lys Asn Asn Pro Tyr Ser Pro Gin Gly lie Glu Thr Asn Tyr Tyr Leu 

485 490 495 

Asn Gin Asn Ser Tyr Asn Gin lie Gin Thr He Asn Gin Glu Leu Gly 

500 505 510 

Arg Asn Pro Phe Arg Lys Val Gly He Val Asn Ser Gin Thr Asn Asn 

515 520 525 

Gly Ala Met Asn Gly lie Gly He Gin Val Gly Tyr Lys Gin Phe Phe 

530 535 540 

Gly Gin Lys Arg Lys Trp Gly Ala Arg Tyr Tyr Gly Phe Phe Asp Tyr 
545 550 555 560 

Asn His Ala Phe He Lys Ser Ser Phe Phe Asn Ser Ala Ser Asp Val 

565 570 575 

Trp Thr Tyr Gly Phe Gly Ala Asp Ala Leu Tyr Asn Phe He Asn Asp 

580 585 590 

Lys Ala Thr Asn Phe Leu Gly Lys Asn Asn Lys Leu Ser Val Gly Leu 

595 600 605 

Phe Gly Gly lie Ala Leu Ala Gly Thr Ser Trp Leu Asn Ser Glu Tyr 

610 615 620 

Val Asn Leu Ala Thr Val Asn Asn Val Tyr Asn Ala Lys Met Asn Val 
625 630 635 640 

Ala Asn Phe Gin Phe Leu Phe Asn Met Gly Val Arg Met Asn Leu Ala 

645 650 . 655 

Arg Ser Lys Lys Lys Gly Ser Asp His Ala Ala Gin His Gly lie Glu 

660 665 670 

Leu Gly Leu Lys He Pro Thr lie Asn Thr Asn Tyr Tyr Ser Phe Met 



675 680 685 

Gly Ala Glu Leu Lys Tyr Arg Arg Leu Tyr Ser Val Tyr Leu Asn Tyr 

690 695 700 

Val Phe Ala Tyr 
705 



<210> 2 

<211> 741 

<212> PRT 

<213> Helicobacter pylori 



<400> 2 
Met Lys Lys His 
1 

Thr Leu Ser Ala 
20 

lie Gly Glu Ala 
35 

Leu Ser Asp Arg 
50 

Thr Leu Asn Thr 
65 

Ala Ala Arg Glu 

Lys Ala Asn Ser 
100 

Ala Val Gly Phe 
115 

Asn Ala Asn Gly 
130 

Glu Pro Gly Tyr 
145 

Thr Pro Gly Tyr 

Asn Glu Ala Tyr 
180 

Ala Leu Lys Glu 
195 

Cys Ser Gly Gly 
210 

Ser Asn Gin Asn 
225 

Lys Ser Val Thr 

Ser Gly Asn Thr 
260 

Glu Gly Val Pro 
275 

Leu lie Ser Thr 
290 

Gin Ser Gly Gly 
305 

Phe Thr Glu Glu 

Glu Leu Val Asn 
340 

Leu Val Gly Gly 



lie Leu Ser Leu 
5 

Glu Asp Asp Gly 

Ala Gin Met Val 
40 

Tyr Glu Ser Leu 
55 

Leu lie Lys Leu 
70 

Asn Leu Gly Ala 
85 

Pro Ala Tyr Gin 

Trp Asn Val Leu 
120 

Gin Lys Ser Thr 
135 

Arg Ser Thr Ser 
150 

Tyr Gly Pro Met 
165 

Gin lie Leu Gin 

Asn Asn Lys Lys 
200 

Gly Asn Asn Asn 
215 

Gly Gly Thr Lys 
230 

Thr Thr lie Ser 
245 

Ser Arg Val Ser 

Asp Ser Ala Gin 
280 

lie Asn Thr Ala 
295 

Pro Gin Met Glu 
310 

lie Ser Ala lie 
325 

Gin Thr Ser Val 
Asn Asn Gly Lys 



Ala Leu Gly Ser 
10 

Phe Tyr Thr Ser 
25 

Thr Asn Thr Lys 

Asn Asn Leu Leu 
60 

Ser Ala Asp Pro 
75 

Ser Ala Lys Asn 
90 

Ala Val Leu Leu 
105 

Gly Tyr Ala Thr 

Ser Ser Thr Thr 
140 

lie Thr Cys Ser 
155 

Ser lie Glu Asn 
170 

Thr Ala Leu Lys 
185 

Val Asn Val Thr 

Cys Ser Ser Glu 
220 

Thr Thr Thr Gin 
235 

Ser Lys Val Val 
250 

Tyr Thr Glu lie 
265 

Ala Leu Leu Ala 

Cys Pro Phe Phe 
300 

Pro Thr Lys Gly 
315 

Gin Lys Met lie 
330 

lie Asn Ser His 
345 

Pro Phe Asn Pro 



Leu Leu Val Ser 
15 

Val Gly Tyr Gin 
30 

Gly lie Gin Asp 
45 

Thr Arg Tyr Ser 

Ser Ala lie Asn 
80 

Leu lie Gly Asp 
95 

Ala He Asn Ala 
110 

Gin Cys Gly Gly 
125 

He Phe Asn Asn 

Leu Asn Gly Tyr 
160 

Phe Lys Lys Leu 
175 

Gin Gly Leu Pro 
190 

Tyr Thr Tyr Thr 
205 

Ala Thr Gly Val 

Thr He Asp Gly 
240 

Asp Ser Thr Ala 
255 

Thr Asn Lys Leu 
270 

Gin Ala Ser Thr 
285 

Ser Val Thr Asn 

Lys Leu Cys Gly 
320 

Thr Asp Ala Gin 
335 

Glu Gin Ser Thr 
350 

Phe Thr Asp Ala 







355 










360 










365 








Gin 


Phe 


Ala 


Gin Gly 


Met 


Leu 


Ala 


Asn 


Ala 


Ser 


Ala 


Gin 


Ala 


Lys 


Met 




370 










375 










380 










Leu 


Asn 


Leu 


Ala 


His 


Gin 


Val 


Gly Gin 


Thr 


He 


Asn 


Pro 


Asn 


Asn 


Leu 


385 










390 










395 










400 


Thr 


Gly Asn 


Phe 


Lys 


Asn 


Phe 


Val 


Thr 


Gly 


Phe 


Leu 


Ala 


Thr 


Cys 


Asn 










405 










410 










415 




Asn 


Pro 


Ser 


Thr 


Ala 


Gly 


Thr 


Gly Gly Thr 


Gin 


Gly Ser 


Ala 


Pro 


Gly 








420 










425 










430 






Thr 


Val 


Thr 


Thr 


Gin 


Thr 


Phe 


Ala 


Ser Gly Cys 


Ala 


Tyr 


Val 


Glu 


Gin 






435 










440 










445 








Thr 


He 
450 


Thr 


Asn 


Leu 


Glu 


Asn 
455 


Ser 


He 


Ala 


His 


Phe 
460 


Gly 


Thr 


Gin 


Glu 


Gin 


Gin 


He 


Gin 


Arg 


Ala 


Glu 


Asn 


lie 


Ala 


Asp 


Thr 


Leu 


Val 


Asn 


Phe 


465 










470 










475 










480 


Lys 


Ser 


Arg 


Tyr 


Ser 
4 85 


Glu 


Leu 


Gly 


Asn 


Thr 
490 


Tyr 


Asn 


Ser 


He 


Thr 
495 


Thr 


Ala 


Leu 


Ser 


Lys 
500 


Val 


Pro 


Asn 


Ala 


Gin 
505 


Ser 


Leu 


Gin 


Asn 


Val 
510 


Val 


Ser 


Lys 


Lys 


Asn 
515 


Asn 


Pro 


Tyr 


Ser 


Pro 
520 


Gin 


Gly 


He 


Glu 


Thr 
525 


Asn 


Tyr 


Tyr 


Leu 


Asn 
530 


Gin 


Asn 


Ser 


Tyr 


Asn 
535 


Gin 


He 


Gin 


Thr 


He 
540 


Asn 


Gin 


Glu 


Leu 


Gly Arg Asn 


Pro 


Phe 


Arg 


Lys 


Val 


Gly 


He 


Val 


Gly 


Ser 


Gin 


Thr 


Asn 


545 










550 










555 










560 


Asn 


Gly Ala 


Met 


Asn 


Gly 


lie 


Gly 


lie 


Gin 


Val 


Gly 


Tyr 


Glu 


Gin 


Phe 










565 










570 










575 




Phe 


Gly 


Gin 


Lys 


Arg 


Lys 


Trp 


Gly Ala Arg Tyr 


Tyr 


Gly 


Phe 


Phe 


Asp 








580 










585 










590 






Tyr 


Asn 


His 
595 


Ala 


Phe 


He 


Lys 


Ser 
600 


Ser 


Phe 


Phe 


Asn 


Ser 
605 


Ala 


Ser 


Asp 


Val 


Trp 


Thr 


Tyr Gly 


Phe 


Gly Ala Asp Ala 


Leu 


Tyr 


Asn 


Phe 


He 


Asn 




610 










615 










620 










Asp 


Lys 


Ala 


Thr 


Asn 


Phe 


Leu 


Gly 


Lys 


Asn 


Asn 


Lys 


Leu 


Ser 


Val 


Gly 


625 










630 










635 










640 


Leu 


Phe 


Gly 


Gly 


He 


Ala 


Leu 


Ala 


Gly Thr 


Ser 


Trp 


Leu 


Asn 


Ser 


Glu 










645 










650 










655 




Tyr 


Val 


Asn 


Leu 
660 


Ala 


Thr 


Val 


Asn 


Asn 
665 


Val 


Tyr 


Asn 


Ala 


Lys 
670 


Met 


Asn 


Val 


Ala 


Asn 
675 


Phe 


Gin 


Phe 


Leu 


Phe 
680 


Asn 


Met 


Gly 


Val 


Arg 
685 


Met 


Asn 


Leu 


Ala 


Arg 
690 


Pro 


Lys 


Lys 


Asn 


Asp 
695 


Ser 


Asp 


His 


Ala 


Ala 
700 


Gin 


His 


Gly 


lie 


Glu 


Leu 


Gly 


Leu 


Lys 


He 


Pro 


Thr 


lie 


Asn 


Thr 


Asn 


Tyr 


Tyr 


Ser 


Phe 


705 










710 










715 










720 


Met 


Gly Ala 


Glu 


Leu 


Lys 


Tyr 


Arg 


Arg 


Leu 


Tyr 


Ser 


Val 


Tyr 


Leu 


Asn 










725 










730 










735 




Tyr 


Val 


Phe 


Ala 


Tyr 

























740 



<210> 3 
<211> 745 
<212> PRT 

<213> Helicobacter pylori 
<400> 3 

Met Lys Lys His He Leu Ser Leu Ala Leu Gly Ser Leu Leu Val Ser 



1 

Thr Leu Ser Ala 
20 

lie Gly Glu Ala 
35 

Leu Ser Asp Asn 
50 

Thr Leu Asn Thr 
65 

Ala Val Arg Glu 

Lys Ala Asn Ser 
100 

Ala Val Gly Leu 
115 

Gly Asn Gly Thr 
130 

Pro Gly Gin Asp 
145 

Gly Pro Gly Lys 

Ala Tyr Gin lie 
180 

Glu Leu Gly Gly 
195 

Cys Arg Gin Thr 
210 

Ala Lys Asn Gly 
225 

Thr Gin Thr Thr 

Thr Tyr Asn Asn 
260 

Ala Glu Gin Leu 
275 

Thr Gin Cys Pro 
290 

Gly Gly Gin Pro 
305 

Phe Gin Gin Glu 

Glu He He Ala 
340 

Asn Asn Leu Asp 
355 

Phe Ala Gin Ser 
370 

Asn Leu Ser Glu 
385 

Asn Asn Val Asn 

Thr Asn Phe Val 
420 

Leu Pro Asn Ala 
435 

Tyr Val Gly Glu 
450 



5 

Glu Asp Asp Gly 

Ala Gin Met Val 
40 

Tyr Glu Asn Leu 
55 

Leu He Lys Leu 
70 

Asn Leu Gly Ala 
85 

Pro Ala Tyr Gin 

Trp Asn Thr He 
120 

Glu Ser Gly Pro 
135 

Ser Thr Gin He 
150 

Ser Met Ser He 
165 

He Gin Gin Ala 

Asn Gly Thr Lys 
200 

Ala Asp He Asn 
215 

Ser Ser Ser Ser 
230 

Ala Thr Thr Thr 
245 

Asn Lys Ala Thr 

Leu Asn Gin Ala 
280 

Leu Val Arg Ser 
295 

Trp Gly Leu Ser 
310 

Phe Ser Gin Val 
325 

Gin Ser Lys He 

Thr Gly Lys Pro 
360 

Met Leu Lys Asn 
375 

Gin Val Lys Lys 
390 

Glu Lys Leu Ala 
405 

Ser Ala Phe Leu 

Gly Val Thr Ser 
440 

Thr He Ser Ala 
455 



10 

Phe Tyr Thr Ser 
25 

Thr Asn Thr Lys 

Asn Asn Leu Leu 
60 

Ser Ala Asp Pro 
75 

Ser Thr Lys Asn 
90 

Ala Val Phe Leu 
105 

Gly Tyr Ala Val 

Gly Ser Val He 
140 

Thr Cys Asn Arg 
155 

Asp Glu Phe Lys 
170 

Leu Lys Asn Gin 
185 

Val Ser Val Asn 

Gly Gly Val Tyr 
220 

Ser Asn Gly Gly 
235 

Gin Asp Gly Val 
250 

Val Lys Phe Asp 
265 

Ala Asn He Met 

Thr Asn Asn Glu 
300 

Thr Ser Gly Asn 
315 

Thr Ser Met He 
330 

Val Ser Glu Asn 
345 

Phe Asn Pro Tyr 

Ala Gin Ala Gin 
380 

Asn Leu Glu Val 
395 

Gly Phe Gly Lys 
410 

Ala Ser Cys Lys 
425 

Asn Thr Trp Gly 

Leu Thr Asn Ser 
460 



15 

Val Gly Tyr Gin 
30 

Gly He Gin Gin 
45 

Thr Arg Tyr Ser 

Ser Ala He Asn 
80 

Leu He Gly Asp 
95 

Ala He Asn Ala 
110 

Met Cys Gly Asn 
125 

Phe Asn Asp Gin 

Phe Glu Ser Thr 
160 

Lys Leu Asn Glu 
175 

Ser Gly Phe Pro 
190 

Tyr Asn Tyr Glu 
205 

Gin Phe Cys Lys 

Asn Gly Ser Ser 
240 

Thr He Thr Thr 
255 

He Thr Asn Asn 
270 

Gin Val Leu Asn 
285 

Asn Thr Pro Gly 

Ala Cys Ser He 
320 

Lys Asn Ala Gin 
335 

Ala Gin Asn Gin 
350 

Thr Asp Ala Ser 
365 

Ala Glu Met Phe 

Met Lys Asn Asn 
400 

Glu Glu Val Met 
415 

Asp Gly Gly Thr 
430 

Ala Gly Cys Ala 
445 

He Ala His Phe 





Thr 


Gin 


Glu 


Gin 


Gin 


He 


Gin 


Gin 


Ala 


Glu 


Asn 


He 


Ala 


Asp 


Thr 


4 65 










470 










475 










480 


Leu 


Val 


Asn 


Phe 


Lys 


Ser 


Arg 


Tvr 


Ser 


Glu 


Leu Gly 


Asn 


Thr 


Tyr 


Asn 










485 










490 










495 




Ser 


lie 




Thr 


Ala 


Leu 


Ser 


Lys 


Val 


Pro 


Asn 


Ala 


Gin 


Ser 


Leu 


Gin 








500 










505 










510 






Asn 


Val 


Val 


Ser 


Lys 


Lys 


Asn 


Asn 


Pro 


Tyr 


Ser 


Pro 


Gin 


Gly 


He 


Glu 






515 










520 










525 








Thr 


Asn 


1 I s - 


xy j. 


Leu 


Asn 


Gin 


Asn 


Ser 


Tyr 


Asn 


Gin 


He 


Gin 


Thr 


He 




53 0 








535 










540 










As ri 


Gin 


Glu 


Leu 


Glv 


Arg 


Asn 


Pro 


Phe 


Arg 


Lys 


Val 


Gly 


He 


Val 


Asn 


Jt J 










550 










555 










560 


Del 


Gin 


Thr 


Asn 


Asn 


Gly 


Ala 


Met 


Asn 


Gly 


He 


Gly 


He 


Gin 


Val 


Gly 










565 










570 










575 




Tyr 


Lys 


Gin 


Phe 


Phe 


Gly Gin 


Lys 


Arg 


Lys 


Trp 


Gly Ala 


Arg 


Tyr 


Tyr 




580 










585 










590 






o-Ly 




±r lie 




Tvt~ 


Asn 


His 


Ala 


Phe 


He 


Lys 


Ser 


Ser 


Phe 


Phe 


Asn 




5 95 










600 










605 










Ala 


Ser 


Asp 


Val 


Trp 


Thr 


Tvr 


Gly 


Phe 


Gly Ala 


Asp 


Ala 


Leu 


Tyr 




610 










615 










620 










As n 


Phe 


lie 


Asn 


Asp 


Lys 


Ala 


Thr 


Asn 


Phe 


Leu 


Gly 


Lys 


Asn 


Asn 


Lys 


62 5 








630 










635 










640 


Leu 


Ser 


Leu 


Glv 


Leu 


Phe 


Gly 


Gly 


He 


Ala 


Leu 


Ala 


Gly 


Thr 


Ser 


Trp 








645 










650 










655 




Leu 


Asn 


Ser 


Glu 


Tvr 


Val 


Asn 


Leu 


Ala 


Thr 


Val 


Asn 


Asn 


Val 


Tyr 


Asn 








660 










665 










670 






Ala 


Lys 


Met 


Asn 


Val 


Ala 


Asn 


Phe 


Gin 


Phe 


Leu 


Phe 


Asn 


Met 


Gly 


Val 




675 










680 










685 








Arg 


Met 


Asn 


Leu 


Ala 


Arg 


Ser 


Lys 


Lys 


Lys 


Gly 


Ser 


Asp 


His 


Ala 


Ala 


690 










695 










700 










Gin 


His 


Gly 


He 


Glu 


Leu 


Gly 


Leu 


Lys 


He 


Pro 


Thr 


He 


Asn 


Thr 


Asn 


705 








710 










715 










720 


Tyr 


Tyr 


Ser 


Phe 


Met 


Gly 


Ala 


Glu 


Leu 


Lys 


Tyr 


Arg 


Arg 


Leu 


Tyr 


Ser 






725 










730 










735 




Val 


Tyr 


Leu 


Asn 


Tyr 


Val 


Phe 


Ala 


Tyr 

















740 745 



<210> 4 

<211> 2127 

<212> DNA 

<213> Helicobacter pylori 



<400> 
atgaaaaaaa 
gaegaegget 
aacaccaaag 
aattacagca 
gcaagggata 
gcgtatcaag 
tacgetttta 
tttaataatg 
ggacatggtg 
atccaaaagg 
accactacaa 
acacctgaaa 
atagtaacac 
caagegagea 



cccttttact 
tttacacaag 
gcattcaaga 
ccctaaacac 
atctaggctc 
ccgtgctttt 
ctgcttgtgg 
tgecaggaca 
ggectatate 
ctttgacagc 
aacttgattt 
agttcccatg 
caacagaaac 
tcattatcac 



ctctctctct 
cgtgggctat 
gctttcagac 
ccttatcaaa 
aagctctagg 
agcactcaat 
tcctggcagt 
agatacgacg 
cactgeaaat 
caatggagct 
cactatcaat 
gagtgatggg 
aaatatcaac 
taccctaaat 



ctctctctct 
caaateggtg 
aattatgaaa 
ttgtccgctg 
aatttgcttg 
gctgcagtgg 
aacgagaatg 
accatcactt 
tatgegaaaa 
aatggagatg 
ggagacaaaa 
aaatatattc 
acagaaaata 
gaggcatgee 



cgtttttgct 
aagccgctca 
agctgaacaa 
atccgagcgc 
atgtcaaaac 
ggttgtggca 
cgaatggagg 
geaattegta 
tcaatcaagc 
gggtccccgt 
gaacgggggg 
acacccaatg 
aegctcaaga 
caaacttcca 



ccacgctgaa 
aatggtgaaa 
tcttttgaat 
gattaacgac 
caattccccc 
agttacaagc 
gatccaaact 
ttatgageca 
ctatcaaatc 
tttaagcaac 
caaaccaaat 
gattaacaca 
gcttttaaaa 
aaatggtggt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



agaagttatt 
atcagcgcga 
gttagtgaaa 
acggacgcca 
aaccaagccg 
tctttagggg 
gtaacttcta 
gacaacagca 
gctgacactc 
atcaccaccg 
aagaataacc 
tacaaccaaa 
atcgtcaatt 
aagcaattct 
aaccatgcgt 
tttggagcgg 
aacaacaagc 
aattctgagt 
gcgaatttcc 
aaaggcagcg 
aacacgaact 
tatttgaatt 



ggcaagggat 
tccaaggcat 
acgcgcaaaa 
gctttgcgca 
aacaagtagt 
tgtgttatga 
acacttgggg 
tcgctcactt 
tagtgaattt 
cgctctccaa 
cctatagccc 
tccaaaccat 
ctcaaaccaa 
ttggccaaaa 
tcattaaatc 
acgctcttta 
tttccgtggg 
atgtgaattt 
aattcttatt 
atcatgcggc 
attattcttt 
atgtgttcgc 



aagcggcaat 
gatcgctaac 
tcaaaacaac 
aagcatgctc 
aaaaaacttt 
agtgcaaggg 
agccggttgc 
tggcactcaa 
caaatctaga 
agtccctaac 
tcaaggcata 
caaccaagaa 
caatggtgcc 
aagaaaatgg 
cagcttcttc 
taacttcatc 
gctttttgga 
agccaccgtg 
caatatggga 
tcagcatggg 
catgggggct 
ttactaa 



gggacaatgt 
gctcaagaag 
ttggatactg 
aaaaacgctc 
gaaaaaatcc 
ggtgagcgta 
gcgtatgtga 
gagcagcaga 
tacagcgaat 
gcgcaaagct 
gagaccaatt 
ctagggcgta 
atgaatggga 
ggcgctaggt 
aactcggctt 
aacgataaag 
gggattgcgt 
aataacgtct 
gtgaggatga 
attgaactag 
gaactcaaat 



gcgggatgtt 
ctgtcgcgca 
gaaaaccatt 
aagcgcaagc 
ctacagcctt 
ggggcaccaa 
aacaaaccat 
tacagcaagc 
taggcaacac 
tgcaaaacgt 
actacctcaa 
acccctttag 
tcggtattca 
attacggctt 
ctgatgtgtg 
ccaccaattt 
tagcgggcac 
ataacgctaa 
atttagccag 
ggcttaaaat 
acagaaggct 



taagaatgaa 
aagcaaaatc 
caacccttac 
agagatttta 
tgtatcagac 
tccaggtcag 
aacgaattta 
cgaaaacatc 
ctataacagc 
ggtgagcaaa 
tcaaaattct 
gaaagtgggc 
ggtgggctat 
ttttgactac 
gacttatggt 
cttaggcaaa 
ttcatggctt 
aatgaatgtg 
atccaagaaa 
ccccaccatc 
ttatagcgtg 



900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2127 



<210> 5 
<211> 2226 
<212> DNA 

<213> Helicobacter pylori 



<400 
atgaaaaaac 
gaagacgacg 
acaaacacca 
acccgataca 
gcggcgcgtg 
ccagcctatc 
ggctatgcta 
atcttcaaca 
acgcctggat 
cagatcctcc 
gtcaatgtta 
gccacaggtg 
aaaagcgtaa 
tcacgtgtct 
gcgctcttag 
agtgtaacta 
tttacagaag 
caaacgagcg 
cctttcaacc 
caagctaaaa 
actgggaatt 
gctggcactg 
tccggttgcg 
ggcactcaag 
aaatctagat 
gtccctaacg 
caaggcatag 
aaccaagaat 



> 5 
acatcctttc 
gcttttacac 
aaggcattca 
gcaccctaaa 
aaaatctggg 
aagcggtgct 
cgcaatgcgg 
acgagccagg 
actatggccc 
aaacggcgtt 
cctacactta 
taagcaatca 
ccaccacgat 
cctacaccga 
cgcaagcgag 
atcaaagtgg 
aaatcagcgc 
tcattaatag 
ctttcacgga 
tgctcaattt 
ttaaaaattt 
gtggcacaca 
cgtatgtgga 
agcagcaaat 
acagcgaatt 
cgcaaagctt 
aaaccaatta 
tagggcgtaa 



attagcttta 
aagcgtgggc 
agatctttca 
caccctgatc 
cgcgagcgcg 
tttagcgatc 
gggtaacgcc 
gtatcgatcc 
tatgagtatt 
aaaacaaggc 
cacatgctca 
aaatggcgga 
cagttcaaaa 
aatcaccaac 
cacgctcatt 
tggtccacag 
gatccaaaag 
ccatgaacaa 
cgctcaattc 
agcccatcaa 
tgttacaggc 
aggttcagct 
acaaaccata 
acaacgagcc 
ggggaatact 
gcaaaacgtg 
ctacttgaat 
cccttttagg 



ggctcgcttt 
tatcaaatcg 
gatcgttatg 
aaattgtccg 
aagaatttga 
aacgcggcgg 
aatggtcaaa 
acttccatca 
gagaatttca 
ttacccgcgc 
gggggaggga 
actaaaacca 
gtcgttgata 
aaattagaag 
agcaccatca 
atggaaccga 
atgatcacag 
tcaaccctag 
gctcaaggca 
gtggggcaaa 
tttttagcca 
ccaggcacgg 
acgaatttag 
gaaaatatcg 
tacaacagca 
gtgagcaaaa 
caaaattctt 
aaagtgggca 



tagtttccac 
gtgaagccgc 
aaagtttgaa 
ctgatccgag 
tcggcgataa 
tagggttttg 
aaagcacctc 
cttgctcttt 
aaaagcttaa 
tcaaagaaaa 
ataataactg 
ccacccaaac 
gcacagcgag 
gtgtgcctga 
acacggcatg 
ctaaagggaa 
acgcgcaaga 
tgggcggtaa 
tgctcgctaa 
ccattaaccc 
catgcaacaa 
ttaccactca 
aaaacagcat 
ctgacactct 
tcaccactgc 
agaataaccc 
acaaccaaat 
tcgtcggctc 



tttgagcgct 
tcaaatggta 
caaccttttg 
cgcgattaat 
agccaattcc 
gaatgtctta 
ttcaacgacc 
gaacgggtat 
cgaagcctat 
caacaagaag 
ctcgtcagaa 
catagacggc 
tggtaacaca 
tagcgctcaa 
cccgtttttt 
gttgtgcggt 
gctggtcaat 
taatggcaag 
cgctagcgcg 
taacaatctt 
cccctcaaca 
aactttcgct 
cgcgcacttt 
agtgaatttc 
gctctccaaa 
ctatagcccg 
ccaaaccatc 
tcaaaccaac 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 



!.Jj Mi 



aacggcgcca 
agaaaatggg 
agcttcttca 
aacttcatca 
ctttttggcg 
gccaccgtga 
aacatgggag 
cagcatggga 
atgggggctg 
tactag 



tgaatgggat 
gcgctaggta 
actcggcttc 
acgataaagc 
ggattgcgtt 
ataatgtcta 
tgaggatgaa 
ttgagttagg 
aactcaaata 



cggtattcag 
ttacggcttt 
tgatgtgtgg 
cactaacttt 
agcgggcact 
taacgctaaa 
tttggccagg 
gcttaaaatc 
cagaaggctt 



gtgggctacg 
tttgattaca 
acttatggtt 
ttaggcaaaa 
tcatggctta 
atgaacgtgg 
cccaagaaaa 
cccaccatca 
tatagcgtgt 



agcaattctt 
accatgcgtt 
ttggagcgga 
acaacaagct 
attctgagta 
cgaacttcca 
acgacagcga 
acacgaacta 
atttgaatta 



tggccaaaaa 
tattaaatcc 
cgctctctat 
ttctgtgggg 
tgtgaattta 
attcttattc 
tcatgcggct 
ctattccttt 
tgtgttcgct 



1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2226 



<210> 6 

<211> 2238 

<212> DNA 

<213> Helicobacter pylori 



<400 
atgaaaaaac 
gaagacgacg 
acaaacacca 
acgagataca 
gcggtgcggg 
ccggcgtatc 
ggctatgcgg 
tttaatgacc 
gggcctggta 
atccagcaag 
gtgagtgtta 
cagttctgca 
acgcaaacaa 
aacaaagcca 
gcaaacatca 
aacactccag 
ttccaacaag 
caaagcaaaa 
ttcaaccctt 
gcagagatgt 
aataatgtta 
agcgcctttt 
aacacttggg 
atcgctcact 
ctagtgaatt 
gcgctctcca 
ccctatagcc 
atccaaacca 
tctcaaacca 
tttggccaaa 
ttcatcaaat 
gacgcgcttt 
ctttctttgg 
tacgtgaatt 
caattcttat 
gatcatgcag 
tattattcct 
tatgtgttcg 



> 6 
acatcctttc 
gcttttacac 
aaggcatcca 
gcaccctaaa 
aaaatctggg 
aagccgtgtt 
tcatgtgcgg 
aaccaggaca 
aaagcatgtc 
ctttaaaaaa 
attacaatta 
aggctaaaaa 
ccgcgacaac 
ccgtcaaatt 
tgcaagtcct 
ggggtggtca 
aatttagcca 
tcgttagtga 
acacggacgc 
tcaatttgag 
acgagaaatt 
tggcaagctg 
gggcgggttg 

ttggcactca 
tcaaatctag 
aagtccctaa 
ctcaaggcat 
tcaaccaaga 
acaatggtgc 
aaagaaaatg 
ccagcttttt 
ataacttcat 
ggctttttgg 
tagccaccgt 
tcaatatggg 
ctcagcatgg 
ttatgggggc 
cttactaa 



attagcttta 
aagcgtaggc 
acagctttca 
cacccttatc 
cgcgagcacg 
tttagcgatc 
gaacgggaac 
ggattccacg 
tattgatgaa 
tcaaagtggg 
cgaatgcaga 
tggtagtagt 
cacgcaagac 
tgacatcacc 
taatacgcaa 
accatggggt 
ggttactagc 
aaacgcgcaa 
cagctttgcg 
cgaacaagtg 
agcaggattt 
caaagatggt 
cgcgtatgtg 
agagcagcag 
atacagcgaa 
cgcgcaaagc 
agagaccaat 
actagggcgt 
catgaatggg 
gggcgctagg 
caactcggct 
caacgataaa 
cgggattgcg 
gaataacgtc 
agtgaggatg 
gattgagtta 
tgaactcaaa 



ggctcgcttt 
tatcagatcg 
gacaattatg 
aaattgtccg 
aagaatttga 
aacgcggcgg 
ggcacagaga 
caaattactt 
ttcaaaaaac 
tttcctgaat 
caaactgctg 
agcagtagta 
ggcgtaacga 
aataacgctg 
tgccctttag 
ttaagcacat 
atgatcaaaa 
aatcaaaaca 
caaagcatgc 
aaaaagaact 
gggaaagaag 
ggcacattgc 
ggagagacga 
atacagcaag 
ttaggcaaca 
ttgcaaaacg 
tactacctca 
aaccccttta 
atcggcattc 
tattacggct 
tctgacgtgt 
gccaccaatt 
ttagcgggca 
tataacgcta 
aatttagcca 
gggcttaaaa 
tacagaaggc 



tagtttccac 
gtgaagccgc 
aaaatttgaa 
ctgatccgag 
tcggcgataa 
tagggttgtg 
gtgggcctgg 
gcaaccgctt 
tcaatgaagc 
taggcgggaa 
atatcaacgg 
atggcggtaa 
tcaccactac 
aacagctgtt 
tgcgttccac 
ccgggaatgc 
acgcccaaga 
acttggatac 
tcaaaaacgc 
tggaagtcat 
aagtaatgac 
ctaatgcagg 
taagcgccct 
ccgaaaacat 
cctataacag 
tggtgagcaa 
atcaaaattc 
ggaaagtggg 
aggtgggcta 
tttttgatta 
ggacttatgg 
tcttaggcaa 
cttcatggct 
aaatgaatgt 
gatccaagaa 
tccccaccat 
tctatagcgt 



tttgagcgct 
tcaaatggta 
caacctttta 
cgcaattaat 
agccaactcc 
gaataccatc 
cagcgtgatc 
tgaatcaact 
ctatcaaatc 
cggcacaaaa 
cggtgtgtat 
tggcagtagc 
ctataataat 
aaatcaagcg 
gaataacgaa 
gtgcagcatc 
aataatcgcg 
tggaaaacca 
tcaagcgcaa 
gaaaaacaac 
caattttgtt 
ggttacttct 
aaccaacagc 
cgctgacact 
catcaccacc 
aaagaataac 
ttacaaccaa 
catcgtcaat 
taagcaattc 
caaccatgcg 
ttttggagcg 
aaacaacaag 
caattctgag 
ggcgaatttc 
aaaaggcagc 
caacacgaac 
gtatttgaac 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2238 



<210> 7 



O -b'i iH: 4LJ i^fe Jj 



<211> 20 
<212> PRT 

<213> Helicobacter pylori 
<400> 7 

Glu Asp Asp Gly Phe Tyr Thr Ser Val Gly Tyr Gin He Gly Glu Ala 

15 10 15 

Ala Gin Met Val 
20 

<210> 8 
<211> 33 
<212> DNA 

<213> "Artificial sequence" 
<400> 8 

ggccatatga aaaaacacat cctttcatta get 33 

<210> 9 
<211> 42 
<212> DNA 

<213> "Artificial sequence" 
<400> 9 

ggcaagcttg ggagtttcac aaaaagctta gtaagegaac ac 42 

<210> 10 
<211> 20 
<212> DNA 

<213> "Artificial sequence" 
<400> 10 

cgaaatcttg tgataagatc 20 

<210> 11 
<211> 33 
<212> DNA 

<213> "Artificial sequence" 
<400> 11 

ccgggcttgg taceggaaga egaeggcttt tac 33 

<210> 12 
<211> 44 
<212> DNA 

<213> "Artificial sequence" 
<400> 12 

ccgggcacta gtttagtaag cgaacacata attcaaatac aege 44 

<210> 13 
<211> 21 
<212> DNA 

<213> "Artificial sequence" 



<400> 13 
eggggggcaa accaaataca c 



21 



<211> 21 
<212> DNA 

<213> "Artificial sequence" 



<400> 14 
gtgtatttgg tttgcccccc g 

<210> 15 
<211> 36 
<212> DNA 

<213> "Artificial sequence" 
<400> 15 

gccgaacgtc tcccctaaat gaggcatgcc caaact 

<210> 16 
<211> 40 
<212> DNA 

<213> "Artificial sequence" 



<400> 16 

ctacggcgtc tcttagggta gtgataatga tgctcgcttg 

<210> 17 
<211> 30 
<212> DNA 

<213> "Artificial sequence" 



<400> 17 

cgcggcgata tggaagacga cggcttttac 

<210> 18 
<211> 30 
<212> DNA 

<213> "Artificial sequence" 
<400> 18 

ccggccggta ccttagtaag cgaacacata 

<210> 19 
<211> 2069 
<212> DNA 

<213> Helicobacter pylori 



<400> 19 
gaagacgacg gcttttacac 
aaaaacacca aaggcatcca 
aataattaca gcaccctaaa 
gacgcaaggg ataatctagg 
ccggcgtatc aagcggtgct 
agctatgctt ttactgcttg 
acttttaata atgtgccagg 
ccaggacatg gcgggcctat 
atcattcaaa aggctttgac 
aacaccacta caaaacttga 



aagcgtaggc tatcaaatcg 
agagctttca gacaattatg 
caccctgatc aaattatccg 
ctcaagctct aggaatttgc 
tttagcactg aatgctgcag 
tggtcctggc agtaacgaga 
acaaaatacg acgaccatca 
atccactgca aactatgcga 
agccaatgga gctaatggag 
tttcactatc aatggagaca 



gtgaagccgc tcaaatggtg 
aaaagctgaa caatcttttg 
ctgatccgag cgcaattaac 
ttgatgtcaa aaccaattcc 
tggggttgtg gcaagttaca 
acgcgaatgg aggtatccaa 
cttgcaattc gtattatgag 
aaatcaatca agcctatcaa 
atggggtccc tgttttaagc 
aaagaacggg gggcaaacca 



aataaaccat 
acacaagcaa 
aagcaagcga 
ggtagcggtt 
gaaatcagtg 
atcgttagtg 
acacagacgc 
taaaccaagc 
actctttagg 
aggtaacttc 
taaaaaacag 
tcgctgacac 
gcatcaccac 
aaaagaataa 
cttacaacca 
gcatcgtcgg 
acaagcaatt 
acaaccatgc 
gttttggagc 
aaaacaacaa 
ttaattctga 
tggcgaactt 
aaaaagacag 
tcaacaccaa 
tgtatttgaa 



taatatacca 
caccaacaac 
gcatcattat 
attggcaagg 
ctatccaagg 
aaaacgcgca 
cagcttcgct 
cgaacaagtg 
ggtgtgttat 
taacacttgg 
catcgctcac 
tctagtgaat 
tgcgctctcc 
cccctatagc 
aatccaaacc 
ctctcaaacc 
ctttggccaa 
gtttattaaa 
ggacgctctc 
gctttctgtg 
gtatgtgaat 
ccaattctta 
cgatcatgcg 
ctactattcc 
ttatgtgttc 



atggagtcat 
agaaaacatc 
cactacccta 
gataagtggc 
catgatcgct 
aaatcaaaac 
caaagcatgc 
gtgaaaaact 
gaagtgcaag 
ggggccggtt 
tttggcactc 
ttcaaatcta 
aaagtcccta 
ccgcaaggca 
atcaaccaag 
aacaacggcg 
aaaagaaaat 
tccagcttct 
tataacttca 
gggctttttg 
ttagccaccg 
ttcaacatgg 
gctcagcatg 
tttatggggg 
gcttactaa 



gggaaagcta 
aacacagaaa 
aatgaggcat 
aatgggacaa 
aacgcgcaag 
aacttggaca 
tcaaaaacgc 
ttgaaaaaat 

ggggtgagcg 

gcgcctatgt 
aagagcagca 
gatacagcga 
acgcgcaaag 
tagaaaccaa 
aattagggcg 
ccatgaatgg 
ggggcgctag 
tcaactcggc 
tcaacgataa 
gcgggattgc 
tgaataatgt 
gagtgaggat 
ggattgagtt 
ctgaactcaa 



tttcaacctc 
ataacgctca 
gcccaaactt 
tgtgtgggat 
aagctgtcgc 
ctggaaaacc 
tcaagcccaa 
ccctagaaat 
taggggcacc 
ggggcaaacc 
aatacaacga 
attggggaat 
cttgcaaaac 
ttactacttg 
taaccctttt 
gatcggtatt 
gtattacggc 
ttctgatgtg 
agccactaac 
gttagcgggc 
ctataacgct 
gaatttggcc 
agggcttaaa 
atacagaagg 



gtggaatgac 
agagctttta 
tcagaatggt 
gtttaagaat 
gcaaagtaaa 
attcacccct 
gcagagattt 
tttgtatcag 
aatccaggtc 
ataacaaatc 
gccgaaaata 
acttacaaca 
gtggtgagca 
aatcaaaatt 
aggaaagtgg 
caggtgggct 
ttttttgatt 
tggacttatg 
tttttaggca 
acttcatggc 
aaaatgaacg 
aggcccaaga 
atccccacca 
ctttatagcg 



660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2069 



<210> 20 

<211> 2156 

<212> DNA 

<213> Helicobacter pylori 



<400> 20 

gaagaccacg gcttttacac aagcgtaggc tatcaaatcg gtaagccgct caaatggtaa 60 

caaacaccaa aggcatccaa gagctttcag acaattatga aaagctgaac aatcttttga 120 

atagttacag caccctaaac acccttatca aattgtccgc tgatccgagc gcggtcagtg 180 

gcgcgatcaa caatttgaac gcgggcgcga cgggtttgct caaagaaaaa accaactccc 240 

ccgcctatca agccgtctct ttggcgttaa acgccgcagt gggcttgtgg aataccatcg 300 

gctatgcggt catgtgcggg aacgggaacg gcacatcgaa tgggcctggc agcgtgatct 3 60 

ttaatggaga gccaggacag ggctccacgc agattacttg caaccgcttt gaatcaactg 420 

gtcctggtca cagcatgtct attgatgaat tcaagaaact caatcaagcc tatcaaatca 480 

tccaacaagc tttaaaagga aatgggtttc ctgaactagg caacaacggc acaagcgtga 54 0 

gtgttgaata taaatatgaa tgcaaacaga ataataatga tatcgacggc ggtgtgagac 600 

aattctgcca agcgaaaaat ggtagtagca gtggtagtaa tggcagtagt ggtaatagca 660 

agcaaacaac cacgcaagac ggcgtaacga tcaccactac taatagtagt aacgaagcca 720 

ccgtcaattt tagcatcacc aataacgctg aacagctgtt aaatcaagcg gcaaacatca 780 

tgcaagtcct taacacgcaa tgccctttgg tgcgatctac gaatgatgaa aacgctccag 840 

ggggtggcca accatggggt ttaagcacga cagggaatgc gtgccaaatc ttccaacaag 900 

aatttagcca ggttactagc atgatcaaaa acgcccaaga aatcgtcgcg caaagccaaa 960 

tcgctaacgc taaccaaaaa gcagaaataa ctaaccccag taacttcaac cctttcacgg 1020 

acgctggctt tgcacaaaac atgctcaaaa acgctagagc gcaagcagag atgttcaatt 1080 

tagctgagca agtgaaaaag aatttggaag tcatgcaaaa caacaataac gttaatgcgg 1140 

aattatcagg atttggggat ggaatgacca attttgttag cgcctttttg gcaagctgca 12 00 

aaaatgatgg gacattgcct aatcaagggg ttacttctaa cacttggggg gccggttgcg 1260 

cgtatgtggg agagacgata acggctttaa ataacagcat cgctcacttt ggcactcaag 1320 

ccgagcagat agaacaagcc gaaaacatcg ctgacactct agtgcatttc aaatctagat 13 80 

acaatgaatt gggcaacact tataacagca tcaccaccgc gctctccaaa gtccctaacg 1440 

cgcaaagctt gcaaaatgtg atgggaaaaa agaataaccc ctatagccca caaggcatag 1500 



aaaccaatta 
taggccgtaa 
tgaatgggat 
gcgctaggta 
actcggcttc 
acgataaagc 
ggattgcgtt 
ataacgtcta 
tgaggatgaa 
ttgagttagg 
aactcaaata 



ctatctcaat 
cccctttagg 
cggtattcag 
ttacggcttt 
tgacgtatgg 
caccaatttc 
agcgggcact 
taacgctaaa 
tttagccaga 
gcttaaaatc 
cagaaggctc 



caaaactctt 
aaagtgggta 
gtgggctata 
tttgattaca 
acttatggtt 
ttaggcaaaa 
tcatggctca 
atggatgtgg 
cccaagaaaa 
cccaccatca 
tatagcgtgt 



acaaccaaat 
ttgtcagctc 
agcaattctt 
accatgcgtt 
ttggagtgga 
acaacaagct 
attctgaata 
cgaacttcca 
aaggcagcga 
acacgaacta 
atttgaatta 



ccaaaccatc 
tcaaaccaac 
tggccaaaaa 
cattaaatcc 
cgctctttat 
ttctgtaggg 
cgtgaattta 
gttcttattc 
tcatgcggct 
ctattccttt 
tgtgttcgct 



aaccaagaat 
aatggcgcca 
agaaaatggg 
agcttcttca 
aatttcatca 
ctttttggcg 
gccaccatga 
aacatgggag 
cagcatggga 
atgggggctg 
tactaa 



1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2156 



<210> 21 

<211> 689 

<212> PRT 

<213> Helicobacter pylori 



<400> 21 
Glu Asp Asp Gly 
1 

Ala Gin Met Val 
20 

Tyr Glu Lys Leu 
35 

Leu lie Lys Leu 
50 

Asn Leu Gly Ser 
65 

Pro Ala Tyr Gin 

Trp Gin Val Thr 
100 

Glu Asn Ala Asn 
115 

Asn Thr Thr Thr 
130 

Gly Pro lie Ser 
145 

lie lie Gin Lys 

Pro Val Leu Ser 
180 

Asp Lys Arg Thr 
195 

Ser His Gly Lys 
210 

Pro Thr Thr Glu 

225 

Lys Gin Ala Ser 

Phe Gin Asn Gly 
260 

Thr Met Cys Gly 
275 

lie Ala Asn Ala 
290 

Asn Ala Gin Asn 



Phe Tyr Thr Ser 
5 

Lys Asn Thr Lys 

Asn Asn Leu Leu 
40 

Ser Ala Asp Pro 
55 

Ser Ser Arg Asn 
70 

Ala Val Leu Leu 
85 

Ser Tyr Ala Phe 

Gly Gly He Gin 
120 

He Thr Cys Asn 
135 

Thr Ala Asn Tyr 
150 

Ala Leu Thr Ala 
165 

Asn Thr Thr Thr 

Gly Gly Lys Pro 
200 

Ala He Ser Thr 
215 

Asn He Asn Thr 
230 

He lie lie Thr 
245 

Gly Ser Gly Tyr 

Met Phe Lys Asn 
280 

Gin Glu Ala Val 
295 

Gin Asn Asn Leu 



Val Gly Tyr Gin 
10 

Gly lie Gin Glu 
25 

Asn Asn Tyr Ser 

Ser Ala lie Asn 
60 

Leu Leu Asp Val 
75 

Ala Leu Asn Ala 
90 

Thr Ala Cys Gly 
105 

Thr Phe Asn Asn 

Ser Tyr Tyr Glu 
140 

Ala Lys lie Asn 
155 

Asn Gly Ala Asn 
170 

Lys Leu Asp Phe 
185 

Asn Lys Pro Leu 

Ser Trp Asn Asp 
220 

Glu Asn Asn Ala 
235 

Thr Leu Asn Glu 
250 

Trp Gin Gly lie 
265 

Glu He Ser Ala 

Ala Gin Ser Lys 
300 

Asp Thr Gly Lys 



lie 


Gly Glu Ala 






15 




Leu 


Ser 


Asp 


Asn 




30 






Thr 


Leu 


Asn 


Thr 


45 








Asp 


Ala 


Arg 


Asp 


Lys 


Thr 


Asn 


Ser 








80 


Ala 


Val 


Gly 


Leu 






95 




Pro 


Gly 


Ser 


Asn 




110 






Val 


Pro 


Gly Gin 


125 








Pro 


Gly His 


Gly 


Gin 


Ala 


Tyr 


Gin 








160 


Gly 


Asp 


Gly 


Val 






175 




Thr 


He 


Asn 


Gly 




190 






He 


Tyr 


Gin 


Trp 


205 








Thr 


Gin 


Ala 


Thr 


Gin 


Glu 


Leu 


Leu 








240 


Ala 


Cys 


Pro 


Asn 






255 




Ser 


Gly 


Asn 


Gly 




270 






He 


Gin 


Gly Met 


285 








He 


Val 


Ser 


Glu 


Pro 


Phe 


Asn 


Pro 



3 05 










310 






Tyr 


Thr 


Asp 


Ala 


Ser 


Phe 


Ala 


Gin 










325 








Gin 


Ala 


Glu 


He 


Leu 


Asn 


Gin 


Ala 








340 










Lys 


He 


Pro 


Arg 


Asn 


Phe 


Val 


Ser 






355 










360 


Val 


Gin 


Gly 


Gly Glu 


Arg 


Arg 


Gly 




370 










375 




Asn 


Thr 


Asn 


Gly Ala 


Gly 


Cys 


Ala 


385 










3 90 






Leu 


Lys 


Asn 


Ser 


He 


Ala 


His 


Phe 










405 








Arg 


Ala 


Glu 


Asn 


He 


Ala 


Asp 


Thr 








420 










Ser 


Glu 


Leu 


Gly 


Asn 


Thr 


Tyr 


Asn 






435 










44 0 


Val 


Pro 


Asn 


Ala 


Gin 


Ser 


Leu 


Gin 




450 










455 




Pro 


Tyr 


Ser 


Pro 


Gin 


Gly 


lie 


Glu 


465 










470 






Ser 


Tyr 


Asn 


Gin 


He 


Gin 


Thr 


He 










485 








Phe 


Arg 


Lys 


Val 


Gly 


He 


Val 


Gly 








500 










Asn 


Gly 


He 


Gly 


He 


Gin 


Val 


Gly 






515 










520 


Arg 


Lys 


Trp 


Gly Ala 


Arg 


Tyr 


Tyr 




530 










535 




Phe 


He 


Lys 


Ser 


Ser 


Phe 


Phe 


Asn 


545 










550 






Gly 


Phe 


Gly 


Ala 


Asp 


Ala 


Leu 


Tyr 










565 








Asn 


Phe 


Leu 


Gly 


Lys 


Asn 


Asn 


Lys 








580 










He 


Ala 


Leu 


Ala 


Gly 


Thr 


Ser 


Trp 






595 










600 


Ala 


Thr 


Val 


Asn 


Asn 


Val 


Tyr 


Asn 




610 










615 




Gin 


Phe 


Leu 


Phe 


Asn 


Met 


Gly Val 


625 










630 






Lys 


Lys 


Asp 


Ser 


Asp 


His 


Ala 


Ala 










645 








Lys 


He 


Pro 


Thr 


He 


Asn 


Thr 


Asn 








660 










Leu 


Lys 


Tyr 


Arg 


Arg 


Leu 


Tyr 


Ser 






675 










680 


Tyr 


















<210> 


22 












<211> 


713 












<212> 


PRT 












<213> 


Helicobacter pyl 


.ori 




<400> 


22 










Glu 


Asp 


Asp 


Gly 


Phe 


Tyr 


Thr 


Ser 







315 










3 2 0 


Ser 


Met 


Leu 


Lys 


Asn 


Ala 


Gin 


Ala 




3 3 0 










3 3 5 




Glu 


Gin 


Val 


Val 


Lys 


Asn 


Phe 


Glu 


345 










3 5 0 






Asp 


Ser 


Leu 


Gly Val 


Cys 


Tyr 


Glu 










365 








Thr 


Asn 


Pro 


Gly 


Gin 


Val 


Thr 


Ser 








380 










Tyr 


Val 


Gly Gin 


Thr 


lie 


Thr 


Asn 






T Q C 










4 0 0 


Gly 


Thr 


Gin 


Glu 


Gin 


Gin 


He 


Gin 




410 










415 




Leu 


Val 


Asn 


Phe 


Lys 


Ser 


Arg 


Tyr 


425 










430 






Ser 


He 


Thr 


Thr 


Ala 


Leu 


Ser 


Lys 










445 








Asn 


Val 


Val 


Ser 


Lys 


Lys 


Asn 


Asn 








460 










Thr 


Asn 


Tyr 


Tyr 


Leu 


Asn 


Gin 


Asn 






4 / r> 










4 8 0 


Asn 


Gin 


Glu 


Leu 


Gly Arg 


Asn 


Pro 




490 










4 95 




Ser 


Gin 


Thr 


Asn 


Asn 


Gly 


Ala 


Met 


5 05 










510 






Tyr 


Lys 


pi n 

Lrin 


Phe 


Phe 


Gly 


Gin 


Lys 










525 








Gly 


Phe 


Phe 


Asp 


Tyr 


Asn 


His 


Ala 








540 










Ser 


Ala 


Ser 


Asp 


Val 


Trp 


Thr 


Tyr 






555 










560 


Asn 


Phe 


He 


Asn 


Asp 


Lys 


Ala 


Thr 




570 










575 




Leu 


Ser 


Val 


Gly 


Leu 


Phe 


Gly Gly 


5 8 5 










590 






Leu 


Asn 


Ser 


Glu 


Tyr 


Val 


Asn 


Leu 










605 








Ala 


Lys 


Met 


Asn 


Val 


Ala 


Asn 


Phe 








620 










Arg 


Met 


Asn 


Leu 


Ala 


Arg 


Pro 


Lys 






635 










640 


Gin 


His 


Gly 


He 


Glu 


Leu 


Gly 


Leu 




650 










655 




Tyr 


Tyr 


Ser 


Phe 


Met 


Gly 


Ala 


Glu 


665 










670 






Val 


Tyr 


Leu 


Asn 


Tyr 


Val 


Phe 


Ala 



685 



Val Gly Tyr Gin He Gly Glu Ala 



15 10 15 

Ala Gin Met Val Thr Asn Thr Lys Gly lie Gin Glu Leu Ser Asp Asn 

20 25 30 

Tyr Glu Lys Leu Asn Asn Leu Leu Asn Ser Tyr Ser Thr Leu Asn Thr 

35 40 45 

Leu lie Lys Leu Ser Ala Asp Pro Ser Ala Val Ser Gly Ala lie Asn 

50 55 60 

Asn Leu Asn Ala Gly Ala Thr Gly Leu Leu Lys Glu Lys Thr Asn Ser 
65 ' 70 75 80 

Pro Ala Tyr Gin Ala Val Ser Leu Ala Leu Asn Ala Ala Val Gly Leu 

85 90 95 

Trp Asn Thr lie Gly Tyr Ala Val Met Cys Gly Asn Gly Asn Gly Thr 

100 105 110 

Ser Asn Gly Pro Gly Ser Val lie Phe Asn Gly Glu Pro Gly Gin Gly 

115 120 125 

Ser Thr Gin lie Thr Cys Asn Arg Phe Glu Ser Thr Gly Pro Gly His 

130 135 140 

Ser Met Ser lie Asp Glu Phe Lys Lys Leu Asn Gin Ala Tyr Gin lie 
145 150 155 160 

lie Gin Gin Ala Leu Lys Gly Asn Gly Phe Pro Glu Leu Gly Asn Asn 

165 170 175 

Gly Thr Ser Val Ser Val Glu Tyr Lys Tyr Glu Cys Lys Gin Asn Asn 

180 185 190 

Asn Asp lie Asp Gly Gly Val Arg Gin Phe Cys Gin Ala Lys Asn Gly 

195 200 205 

Ser Ser Ser Gly Ser Asn Gly Ser Ser Gly Asn Ser Lys Gin Thr Thr 

210 215 220 

Thr Gin Asp Gly Val Thr lie Thr Thr Thr Asn Ser Ser Asn Glu Ala 
225 230 235 240 

Thr Val Asn Phe Ser lie Thr Asn Asn Ala Glu Gin Leu Leu Asn Gin 

245 250 255 

Ala Ala Asn lie Met Gin Val Leu Asn Thr Gin Cys Pro Leu Val Arg 

260 265 270 

Ser Thr Asn Asp Glu Asn Ala Pro Gly Gly Gly Gin Pro Trp Gly Leu 

275 280 285 

Ser Thr Thr Gly Asn Ala Cys Gin He Phe Gin Gin Glu Phe Ser Gin 

290 295 300 

Val Thr Ser Met He Lys Asn Ala Gin Glu He Val Ala Ser Gin He 
305 310 315 320 

Ala Asn Ala Asn Gin Lys Ala Glu lie Thr Asn Pro Ser Asn Phe Asn 

325 330 335 

Pro Phe Thr Asp Ala Gly Phe Ala Gin Asn Met Leu Lys Asn Ala Arg 

340 345 350 

Ala Gin Ala Glu Met Phe Asn Leu Ala Glu Gin Val Lys Lys Asn Leu 

355 360 365 

Glu Val Met Gin Asn Asn Asn Asn Val Asn Ala Glu Leu Ser Gly Phe 

370 375 380 

Gly Asp Gly Met Thr Asn Phe Val Ser Ala Phe Leu Ala Ser Cys Lys 
385 390 395 400 

Asn Asp Gly Thr Leu Pro Asn Gin Gly Val Thr Ser Asn Thr Trp Gly 

405 410 415 

Ala Gly Cys Ala Tyr Val Gly Glu Thr He Thr Ala Leu Asn Asn Ser 

420 425 430 

He Ala His Phe Gly Thr Gin Ala Glu Gin He Glu Gin Ala Glu Asn 

435 440 445 

He Ala Asp Thr Leu Val His Phe Lys Ser Arg Tyr Asn Glu Leu Gly 
450 455 460 



Asn Thr Tyr Asn Ser lie Thr Thr Ala Leu Ser Lys Val Pro Asn Ala 
465 470 475 480 

Gin Ser Leu Asn Val Met Gly Lys Lys Asn Asn Pro Tyr Ser Pro Gly 

485 490 495 

He Glu Thr Asn Tyr Tyr Leu Asn Asn Ser Tyr Asn Gin He Gin Thr 

500 505 510 

He Asn Gin Glu Leu Gly Arg Asn Pro Phe Arg Lys Val Gly He Val 

515 520 525 

Ser Ser Gin Thr Asn Asn Gly Ala Met Asn Gly He Gly He Gin Val 

530 535 540 

Gly Tyr Lys Gin Phe Phe Gly Gin Lys Arg Lys Trp Gly Ala Arg Tyr 
545 550 555 560 

Tyr Gly Phe Phe Asp Tyr Asn His Ala Phe He Lys Ser Ser Phe Phe 

565 570 575 

Asn Ser Ala Ser Asp Val Trp Thr Tyr Gly Phe Gly Val Asp Ala Leu 

580 ' 585 590 

Tyr Asn Phe lie Asn Asp Lys Ala Thr Asn Phe Leu Gly Lys Asn Asn 

595 600 605 

Lys Leu Ser Val Gly Leu Phe Gly Gly He Ala Leu Ala Gly Thr Ser 

610 615 620 

Trp Leu Asn Ser Glu Tyr Val Asn Leu Ala Thr Met Asn Asn Val Tyr 
625 630 635 640 

Asn Ala Lys Met Asp Val Ala Asn Gin Phe Leu Phe Asn Met Gly Val 

645 650 655 

Arg Met Asn Leu Ala Arg Pro Lys Lys Lys Gly Ser Asp His Ala Ala 

660 665 670 

Gin His Gly He Glu Leu Gly Leu Lys He Pro Thr He Asn Thr Asn 

675 680 685 

Tyr Tyr Ser Phe Met Cys Ala Glu Leu Lys Tyr Arg Arg Leu Tyr Ser 

690 695 700 

Val Tyr Leu Asn Tyr Val Phe Ala Tyr 
705 710 

<210> 23 
<211> 20 
<212> PRT 

<213> Helicobacter pylori 
<400> 23 

Cys Ala Ser Gly Asn Thr Ser His Val He Thr Asn Lys Leu Asp Gly 

15 10 15 

Val Pro Asp Ser 
20 

<210> 24 
<211> 18 
<212> PRT 

<213> Helicobacter pylori 
<400> 24 

Cys Ser Pro Ser Val Asn Gly Thr Lys Thr Thr Thr Gin Thr He Asp 

15 10 15 

Gly Lys 



<210> 25 
<211> 12 



O ^ ./ il; 



<212> PRT 

<213> Helicobacter pylori 
<400> 25 

Cys Tyr Pro Glu Asn Leu Ser Glu Asn Phe Lys Asn 
15 10 

<210> 26 
<211> 12 
<212> PRT 

<213> Helicobacter pylori 
<400> 26 

Cys Asn Pro Glu Asn Leu Ser Glu Asn Phe Lys Asn 
15 10 

<210> 27 
<211> 8 
<212> PRT 

<213> Helicobacter pylori 
<400> 27 

Cys Ser Gly Gin Gly Asn Asn Asn 
1 5 

<210> 28 
<211> 22 
<212> PRT 

<213> Helicobacter pylori 
<400> 28 

Cys Val Met Lys Asn Asn Asn Asn Val Asn Glu Lys Leu Ala Gly Phe 

15 10 15 

Gly Lys Glu Glu Val Met 
20 

<210> 29 
<211> 18 
<212> PRT 

<213> Helicobacter pylori 
<400> 29 

Cys Lys Ala Lys Asn Gly Ser Ser Ser Ser Ser Asn Gly Gly Asn Gly 

15 10 15 

Ser Ser 



<210> 30 

<211> 16 

<212> PRT 

<213> Helicobacter pylori 

<400> 30 

Cys Leu Val Arg Ser Thr Asn Asn Glu Asn Thr Pro Gly Gly Gly Gin 
15 10 15 



<210> 31 



<211> 16 
<212> PRT 

<213> Helicobacter pylori 



<400> 31 

Cys Leu Val Arg Ser Thr Asn Asn Glu Asn Thr Pro Gly Gly Gly Gin 
15 10 15 

<210> 32 
<211> 14 
<212> PRT 

<213> Helicobacter pylori 
<400> 32 

Cys Arg Gin Thr Ala Asp lie Asn Gly Gly Val Tyr Gin Phe 
15 10 

<210> 33 
<211> 60 
<212> DNA 

<213> "Artificial Sequence" 
<400> 33 

gargaygayg gnttytayac nwsngtnggn taycarathg gngargcngc ncaratggtn 60 

<210> 34 
<211> 20 
<212> PRT 

<213> Helicobacter pylori 
<400> 34 

Glu Asp Asp Gly Phe Tyr Thr Ser Val Gly Tyr Gin He Gly Glu Ala 

1 5 10 15 

Ala Gin Met Val 
20 

<210> 35 
<211> 21 
<212> PRT 

<213> Helicobacter pylori 
<400> 35 

Glu Asp Asp Gly Phe Tyr Thr Ser Val Gly Tyr Gin He Gly Glu Ala 

15 10 15 

Ala Gin Met Val Lys 
20 

<210> 36 
<211> 15 
<212> PRT 

<213> Helicobacter pylori 
<400> 36 

Ser Thr Ser Ser Thr Thr He Phe Asn Asn Glu Pro Gly Tyr Arg 
1.5 10 15 



<210> 37 



<211> 14 
<212> PRT 

<213> Helicobacter pylori 
<400> 37 

Thr Gly Gly Lys Pro Asn Lys Pro Leu lie Tyr Gin Trp Ser 
15 10 

<210> 38 
<211> 9 
<212> PRT 

<213> Helicobacter pylori 
<400> 38 

Thr Thr Thr Gin Thr lie Asp Gly Lys 
1 5 

<210> 39 
<211> 13 
<212> PRT 

<213> Helicobacter pylori 
<400> 39 

Asn Ser lie Ala His Phe Gly Thr Gin Glu Gin Gin lie 
15 10 

<210> 40 
<211> 13 
<212> PRT 

<213> Helicobacter pylori 
<400> 40 

Val Pro Asn Ala Gin Ser Leu Gin Asn Val Val Ser Lys 
15 10 

<210> 41 
<211> 14 
<212> PRT 

<213> Helicobacter pylori 
<400> 41 

Ser Lys Lys Asn Asn Pro Tyr Ser Pro Gin Gly lie Glu Thr 
15 10 

<210> 42 
<211> 7 
<212> PRT 

<213> Helicobacter pylori 



<400> 42 
Asn Tyr Tyr Leu Asn Gin Asn 
1 5 



SEQUENCE LISTING 
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<160> 42 

<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 708 
<212> PRT 

<213> Helicobacter pylori 



<400> 1 



Met 


Lys 


Lys 


Thr 


Leu 


Leu 


Leu 


Ser 


Leu 


Ser 


Leu 


Ser 


Leu 


Ser 


Phe 


Leu 


1 




5 










10 










15 




Leu 


His 


Ala 


Glu 


Asp 


Asp 


Gly 


Phe 


Tyr 


Thr 


Ser 


Val 


Gly Tyr 


Gin 


He 








20 










25 










30 






Gly 


Glu 


Ala 


Ala 


Gin 


Met 


Val 


Lys 


Asn 


Thr 


Lys 


Gly 


He 


Gin 


Glu 


Leu 




35 










40 










45 








Ser 


Asp 


Asn 


Tyr 


Glu 


Lys 


Leu 


Asn 


Asn 


Leu 


Leu 


Asn 


Asn 


Tyr 


Ser 


Thr 




50 










55 










60 










Leu 


Asn 


Thr 


Leu 


He 


Lys 


Leu 


Ser 


Ala 


Asp 


Pro 


Ser 


Ala 


He 


Asn Asp 


65 










70 










75 










80 


Ala 


Arg 


Asp 


Asn 


Leu 


Gly 


Ser 


Ser 


Ser 


Arg 


Asn 


Leu 


Leu 


Asp 


Val 


Lys 






85 










90 










95 




Thr 


Asn 


Ser 


Pro 


Ala 


Tyr 


Gin 


Ala 


Val 


Leu 


Leu 


Ala 


Leu 


Asn 


Ala 


Ala 








100 










105 










110 






Val 


Gly 


Leu 


Trp 


Gin 


Val 


Thr 


Ser 


Tyr 


Ala 


Phe 


Thr 


Ala 


Cys 


Gly 


Pro 




115 










120 










125 








Gly 


Ser 


Asn 


Glu 


Asn 


Ala 


Asn 


Gly 


Gly 


He 


Gin 


Thr 


Phe 


Asn 


Asn 


Val 


130 










135 










140 










Pro 


Gly 


Gin 


Asp 


Thr 


Thr 


Thr 


He 


Thr 


Cys 


Asn 


Ser 


Tyr 


Tyr 


Glu 


Pro 


145 








150 










155 










160 


Gly 


His 


Gly 


Gly 


Pro 


He 


Ser 


Thr 


Ala 


Asn 


Tyr 


Ala 


Lys 


He 


Asn 


Gin 



165 170 175 



Ala 


Tyr 


Gin 


He 


He 


Gin 


Lys 


Ala 


Leu 


Thr 


Ala 


Asn 


Gly 


Ala 


Asn Gly 






180 










185 










190 






Asp 


Gly 


Val 


Pro 


Val 


Leu 


Ser 


Asn 


Thr 


Thr 


Thr 


Lys 


Leu 


Asp 


Phe 


Thr 


195 










200 










205 








He 


Asn 


Gly 


Asp 


Lys 


Arg 


Thr 


Gly 


Gly 


Lys 


Pro 


Asn 


Thr 


Pro 


Glu 


Lys 




210 








215 










220 










Phe 


Pro 


Trp 


Ser 


Asp 


Gly 


Lys 


Tyr 


He 


His 


Thr 


Gin 


Trp 


He 


Asn 


Thr 


225 








230 










235 










240, 


He 


Val 


Thr 


Pro 


Thr 
245 


Glu 


Thr 


Asn 


He 


Asn 
250 


Thr 


Glu 


Asn 


Asn 


Ala 
255 


Gin' 


Glu 


Leu 


Leu 


Lys 
260 


Gin 


Ala 


Ser 


He 


He 
265 


He 


Thr 


Thr 


Leu 


Asn 
270 


Glu 


Ala 


Cys 


Pro 


Asn 


Phe 


Gin 


Asn 


Gly 


Gly 


Arg 


Ser 


Tyr 


Trp 


Gin 


Gly 


He 


Ser 




275 










280 










285 








Gly 


Asn 


Gly 


Thr 


Met 


Cys 


Gly 


Met 


Phe 


Lys 


Asn 


Glu 


He 


Ser 


Ala 


He 


290 










295 










300 










Gin 


Gly 


Met 


He 


Ala 


Asn 


Ala 


Gin 


Glu 


Ala 


Val 


Ala 


Gin 


Ser 


Lys 


He 


305 








310 










315 










320 


Val 


Ser 


Glu 


Asn 


Ala 


Gin 


Asn 


Gin 


Asn 


Asn 


Leu Asp 


Thr 


Gly 


Lys 


Pro 










325 










330 










335 




Phe 


Asn 


Pro 


Tyr 
340 


Thr 


Asp 


Ala 


Ser 


Phe 
345 


Ala 


Gin 


Ser 


Met 


Leu 
350 


Lys 


Asn 


Ala 


Gin 


Ala 
355 


Gin 


Ala 


Glu 


He 


Leu 
360 


Asn 


Gin 


Ala 


Glu 


Gin 
365 


Val 


Val 


Lys 


Asn 


Phe 


Glu 


Lys 


He 


Pro 


Thr 


Ala 


Phe 


Val 


Ser 


Asp 


Ser 


Leu 


Gly Val 




370 








375 










380 








Gin 


Cys 


Tyr 


Glu 


Val 


Gin 


Gly 


Gly 


Glu 


Arg 


Arg 


Gly 


Thr 


Asn 


Pro 


Gly 


385 








390 










395 










400 


Val 


Thr 


Ser 


Asn 


Thr 
405 


Trp 


Gly 


Ala 


Gly 


Cys 
410 


Ala 


Tyr 


Val 


Lys 


Gin 
415 


Thr 


He 


Thr 


Asn 


Leu 
420 


Asp 


Asn 


Ser 


He 


Ala 
425 


His 


Phe 


Gly 


Thr 


Gin 
430 


Glu 


Gin 


Gin 


He 


Gin 
435 


Gin 


Ala 


Glu 


Asn 


He 
440 


Ala 


Asp 


Thr 


Leu 


Val 
445 


Asn 


Phe 


Lys 


Ser 


Arg 
450 


Tyr 


Ser 


Glu 


Leu 


Gly 
455 


Asn 


Thr 


Tyr 


Asn 


Ser 
460 


He 


Thr 


Thr 


Ala 


Leu 


Ser 


Lys 


Val 


Pro 


Asn 


Ala 


Gin 


Ser 


Leu 


Gin 


Asn 


Val 


Val 


Ser 


Lys 


465 








470 










475 










480 


Lys 


Asn 


Asn 


Pro 


Tyr 


Ser 


Pro 


Gin 


Gly 


lie 


Glu 


Thr 


Asn 


Tyr 


Tyr 


Leu 








485 










490 










495 




Asn 


Gin 


Asn 


Ser 


Tyr 


Asn 


Gin 


He 


Gin 


Thr 


He 


Asn 


Gin 


Glu 


Leu 


Gly 








500 








505 










510 






Arg 


Asn 


Pro 


Phe 


Arg 


Lys 


Val 


Gly 


He 


Val 


Asn 


Ser 


Gin 


Thr 


Asn 


Asn 




515 










520 










525 








Gly Ala 


Met 


Asn 


Gly 


He 


Gly 


He 


Gin 


Val 


Gly Tyr 


Lys 


Gin 


Phe 


Phe 




530 










535 










540 




Phe 






Gly Gin 


Lys 


Arg 


Lys 


Trp 


Gly 


Ala 


Arg 


Tyr 


Tyr Gly 


Phe 


Asp 


Tyr 


545 








550 










555 










560 


Asn 


His 


Ala 


Phe 


He 


Lys 


Ser 


Ser 


Phe 


Phe 


Asn 


Ser 


Ala 


Ser 


Asp 


Val 











565 










570 










575 




Trp 


Thr 


Tyr 


Gly 


Phe 


Gly 


Ala 


Asp 


Ala 


Leu 


Tyr 


Asn 


Phe 


He 


Asn 


Asp 






580 










585 










590 






Lys 


Ala 


Thr 


Asn 


Phe 


Leu 


Gly 


Lys 


Asn 


Asn 


Lys 


Leu 


Ser 


Val 


Gly 


Leu 




595 










600 










605 








Phe 


Gly 


Gly 


He 


Ala 


Leu 


Ala 


Gly 


Thr 


Ser 


Trp 


Leu 


Asn 


Ser 


Glu 


Tyr 




610 








615 










620 










Val 


Asn 


Leu 


Ala 


Thr 


Val 


Asn 


Asn 


Val 


Tyr 


Asn 


Ala 


Lys 


Met 


Asn 


Val 


625 










630 










635 










640 


Ala 


Asn 


Phe 


Gin 


Phe 


Leu 


Phe 


Asn 


Met 


Gly 


Val 


Arg 


Met 


Asn 


Leu 


Ala 










645 










650 










655 




Arg 


Ser 


Lys 


Lys 


Lys 




Q o v~ 


i-ifap 


nio 


r\ _L CI 


Ala 


Gin 


His 


Glv 


He 


Glu 






660 










665 










670 






Leu 


Gly 


Leu 


Lys 


He 


Pro 


Thr 


He 


Asn 


Thr 


Asn 


Tyr 


Tyr 


Ser 


Phe 


Met 




675 










680 










685 








Gly 


Ala 


Glu 


Leu 


Lys 


Tyr 


Arg 


Arg 


Leu 


Tyr 


Ser 


Val 


Tyr 


Leu 


Asn 


Tyr 


690 










695 










7 n n 










Val 


Phe 


Ala 


Tyr 


























705 


































<210> 


2 




























<211> 


741 




























<212> 


PRT 




























<213> 


Helicobacter pylori 




















<400> 


2 


























Met 


Lys 


Lys 


His 


He 


Leu 


Ser 


Leu 


Ala 


Leu 


Gly 


Ser 


Leu 


Leu 


Val 


Ser 


1 




5 










10 










15 




Thr 


Leu 


Ser 


Ala 


Glu 


Asp 


Asp 


Gly 


Phe 


Tyr 


Thr 


Ser 


Val 


Gly 


Tyr 


Gin 








20 










25 










30 






He 


Gly 


Glu 


Ala 


Ala 


Gin 


Met 


Val 


Thr 


Asn 


Thr 


Lys 


Gly 


He 


Gin 


Asp 




35 










40 










45 








Leu 


Ser 


Asp 


Arg 


Tyr 


Glu 


Ser 


Leu 


Asn 


Asn 


Leu 


Leu 


Thr 


Arg 


Tyr 


Ser 




50 










55 










60 










Thr 


Leu 


Asn 


Thr 


Leu 


He 


Lys 


Leu 


Ser 


Ala 


Asp 


Pro 


Ser 


Ala 


He 


Asn 


65 










70 










75 










80 


Ala 


Ala 


Arg 


Glu 


Asn 


Leu 


Gly 


Ala 


Ser 


Ala 


Lys 


Asn 


Leu 


He 


Gly Asp 








85 










90 










95 




Lys 


Ala 


Asn 


Ser 


Pro 


Ala 


Tyr 


Gin 


Ala 


Val 


Leu 


Leu 


Ala 


He 


Asn 


Ala 






100 










105 










110 






Ala 


Val 


Gly 


Phe 


Trp 


Asn 


Val 


Leu 


Gly 


Tyr 


Ala 


Thr 


Gin 


Cys 


Gly Gly 






115 










120 










125 








Asn 


Ala 


Asn 


Gly 


Gin 


Lys 


Ser 


Thr 


Ser 


Ser 


Thr 


Thr 


He 


Phe 


Asn 


Asn 




130 










135 










140 










Glu 


Pro 


Gly 


Tyr 


Arg 


Ser 


Thr 


Ser 


He 


Thr 


Cys 


Ser 


Leu 


Asn 


Gly Tyr 


145 








150 










155 










160 


Thr 


Pro 


Gly 


Tyr 


Tyr 


Gly 


Pro 


Met 


Ser 


He 


Glu 


Asn 


Phe 


Lys 


Lys 


Leu 






165 










170 










175 




Asn 


Glu 


Ala 


Tyr 


Gin 


He 


Leu 


Gin 


Thr 


Ala 


Leu 


Lys 


Gin 


Gly 


Leu 


Pro 



U *9 7jS Obliy 



Ala Leu 

Cys Ser 
210 
Ser Asn 
225 

Lys Ser 

Ser Gly 

Glu Gly 

Leu lie 
290 
Gin Ser 
305 

Phe Thr 

Glu Leu 

Leu Val 

Gin Phe 
370 
Leu Asn 
385 

Thr Gly 

Asn Pro 

Thr Val 

Thr lie 
450 
Gin Gin 
465 

Lys Ser 

Ala Leu 

Lys Lys 

Leu Asn 
530 
Gly Arg 
545 

Asn Gly 
Phe Gly 



Lys Glu 
195 

Gly Gly 

Gin Asn 

Val Thr 

Asn Thr 
260 
Val Pro 
275 

Ser Thr 

Gly Gly 

Glu Glu 

Val Asn 
340 
Gly Gly 
355 

Ala Gin 

Leu Ala 

Asn Phe 

Ser Thr 
420 
Thr Thr 
435 

Thr Asn 

He Gin 

Arg Tyr 

Ser Lys 
500 
Asn Asn 
515 

Gin Asn 

Asn Pro 

Ala Met 

Gin Lys 
580 



Asn Asn 

Gly Asn 

Gly Gly 
230 
Thr Thr 
245 

Ser Arg 

Asp Ser 

He Asn 

Pro Gin 
310 
He Ser 
325 

Gin Thr 

Asn Asn 

Gly Met 

His Gin 
390 
Lys Asn 
405 

Ala Gly 

Gin Thr 

Leu Glu 

Arg Ala 
470 
Ser Glu 
485 

Val Pro 

Pro Tyr 

Ser Tyr 

Phe Arg 
550 
Asn Gly 
565 

Arg Lys 



Lys Lys 
200 
Asn Asn 
215 

Thr Lys 

He Ser 

Val Ser 

Ala Gin 
280 
Thr Ala 
295 

Met Glu 

Ala He 

Ser Val 

Gly Lys 
360 
Leu Ala 
375 

Val Gly 

Phe Val 

Thr Gly 

Phe Ala 
440 
Asn Ser 
455 

Glu Asn 

Leu Gly 

Asn Ala 

Ser Pro 
520 
Asn Gin 
535 

Lys Val 
He Gly 
Trp Gly 



Val Asn 

Cys Ser 

Thr Thr 

Ser Lys 
250 
Tyr Thr 
265 

Ala Leu 

Cys Pro 

Pro Thr 

Gin Lys 
330 
He Asn 
345 

Pro Phe 

Asn Ala 

Gin Thr 

Thr Gly 
410 
Gly Thr 
425 

Ser Gly 

He Ala 

He Ala 

Asn Thr 
490 
Gin Ser 
505 

Gin Gly 

He Gin 

Gly He 

He Gin 
570 
Ala Arg 
585 



Val Thr 

Ser Glu 
220 
Thr Gin 
235 

Val Val 

Glu lie 

Leu Ala 

Phe Phe 
300 
Lys Gly 
315 

Met He 

Ser His 

Asn Pro 

Ser Ala 
380 
He Asn 
395 

Phe Leu 

Gin Gly 

Cys Ala 

His Phe 
460 
Asp Thr 
475 

Tyr Asn 

Leu Gin 

He Glu 

Thr lie 
540 
Val Gly 
555 

Val Gly 
Tyr Tyr 



Tyr Thr Tyr Thr 
205 

Ala Thr Gly Val 

Thr He Asp Gly 
240 

Asp Ser Thr Ala 
255 

Thr Asn Lys Leu 
270 

Gin Ala Ser Thr 
285 

Ser Val Thr Asn 

Lys Leu Cys Gly 
320 

Thr Asp Ala Gin 
335 

Glu Gin Ser Thr 
350 

Phe Thr Asp Ala 
365 

Gin Ala Lys Met 

Pro Asn Asn Leu 
400 

Ala Thr Cys Asn 
415 

Ser Ala Pro Gly 
430 

Tyr Val Glu Gin 
445 

Gly Thr Gin Glu 

Leu Val Asn Phe 
480 

Ser He Thr Thr 
495 

Asn Val Val Ser 
510 

Thr Asn Tyr Tyr 

525 

Asn Gin Glu Leu 

Ser Gin Thr Asn 
560 

Tyr Glu Gin Phe 
575 

Gly Phe Phe Asp 
590 



Tyr Asn His 


Ala 


Phe 


He 


Lys 


Ser 


Ser 


Phe 


Phe 


Asn 


Ser 


Ala 


Ser 


Asp 






595 










600 










605 








Val 


Trr> 


Thr 


Tyr 


Gly 


Phe 


Gly 


Ala 


Asp 


Ala 


Leu 


Tyr 


Asn 


Phe 


lie 


Asn 




610 










615 










620 










Asp 


Lys 


Ala 


Thr 


Asn 


Phe 


Leu 


Gly 


Lys 


Asn 


Asn 


Lys 


Leu 


Ser 


Val 


Gly 


625 








630 










635 










640 


Leu 


Phe 


Gly 


Gly 


He 


Ala 


Leu 


Ala 


Gly 


Thr 


Ser 


Trp 


Leu 


Asn 


Ser 


Glu 








645 










650 










655 




Tvr 


Val 


Asn 


Leu 


Ala 


Thr 


Val 


Asn 


Asn 


Val 


Tyr 


Asn 


Ala 


Lys 


Met 


Asn 






660 










665 










670 






Val 


Ala 


Asn 


Phe 


Gin 


Phe 


Leu 


Phe 


Asn 


Met 


Gly 


Val 


Arg 


Met 


Asn 


Leu 






675 










680 










685 








Ala 


Arg 


Pro 


Lys 


Lys 


Asn 


Asp 


Ser 


Asp 


His 


Ala 


Ala 


Gin 


His 


Gly 


lie 




690 










695 










700 










Glu 


Leu Gly 


Leu 


Lys 


He 


Pro 


Thr 


He 


Asn 


Thr 


Asn 


Tyr 


Tyr 


Ser 


Phe 


705 










710 










715 










720 


Met 


Gly Ala 


Glu 


Leu 


Lys 


Tyr 


Arg 


Arg 


Leu 


Tyr 


Ser 


Val 


Tyr 


Leu 


Asn 










725 










730 










735 




Tyr 


Val 


Phe 


Ala 


Tyr 

























740 



<210> 3 

<211> 745 

<212> PRT 

<213> Helicobacter pylori 



<400> 3 



Met 


Lys 


Lys 


His 


He 


Leu 


Ser 


Leu 


Ala 


Leu 


Gly 


Ser 


Leu 


Leu 


Val 


Ser 


1 




5 










10 










15 




Thr 


Leu 


Ser 


Ala 
20 


Glu 


Asp 


Asp 


Gly 


Phe 
25 


Tyr 


Thr 


Ser 


Val 


Gly 
30 


Tyr 


Gin 


lie 


Gly 


Glu 


Ala 


Ala 


Gin 


Met 


Val 


Thr 


Asn 


Thr 


Lys 


Gly 


He 


Gin 


Gin 




35 










40 










45 








Leu 


Ser 


Asp 


Asn 


Tyr 


Glu 


Asn 


Leu 


Asn 


Asn 


Leu 


Leu 


Thr 


Arg 


Tyr 


Ser 




50 








55 










60 










Thr 


Leu 


Asn 


Thr 


Leu 


He 


Lys 


Leu 


Ser 


Ala 


Asp 


Pro 


Ser 


Ala 


He 


Asn 


65 










70 










75 










80 


Ala 


Val 


Arg 


Glu 


Asn 


Leu 


Gly 


Ala 


Ser 


Thr 


Lys 


Asn 


Leu 


He 


Gly Asp 








85 










90 










95 




Lys 


Ala 


Asn 


Ser 


Pro 


Ala 


Tyr 


Gin 


Ala 


Val 


Phe 


Leu 


Ala 


He 


Asn 


Ala 






100 










105 










110 






Ala 


Val 


Gly 


Leu 


Trp 


Asn 


Thr 


He 


Gly 


Tyr 


Ala 


Val 


Met 


Cys 


Gly Asn 






115 










120 










125 








Gly Asn 


Gly 


Thr 


Glu 


Ser 


Gly 


Pro 


Gly 


Ser 


Val 


He 


Phe 


Asn 


Asp 


Gin 




130 








135 










140 










Pro 


Gly 


Gin 


Asp 


Ser 


Thr 


Gin 


He 


Thr 


Cys 


Asn 


Arg 


Phe 


Glu 


Ser 


Thr 


145 








150 










155 










160 


Gly 


Pro 


Gly 


Lys 


Ser 


Met 


Ser 


lie 


Asp 


Glu 


Phe 


Lys 


Lys 


Leu 


Asn 


Glu 






165 










170 










175 




Ala 


Tyr 


Gin 


He 


He 


Gin 


Gin 


Ala 


Leu 


Lys 


Asn 


Gin 


Ser 


Gly 


Phe 


Pro 



180 185 190 



Glu 


Leu 


Gly 


Gly 


Asn 


Gly 


Thr 


Lys 


Val 


Ser 


Val 


Asn 


Tyr Asn 


Tyr 


Glu 






195 










200 










205 








Cys 


Arg 


Gin 


Thr 


Ala 


Asp 


He 


Asn 


Gly 


Gly 


Val 


Tyr 


Gin 


Phe 


Cys 


Lys 


210 










215 










220 










Ala 


Lys 


Asn 


Gly 


Ser 


Ser 


Ser 


Ser 


Ser 


Asn 


Gly Gly 


Asn 


Gly 


Ser 


Ser 


225 






230 










235 










240 


Thr 


Gin 


Thr 


Thr 


Ala 


Thr 


Thr 


Thr 


Gin 


Asp 


Gly Val 


Thr 


He 


Thr 


Thr 










245 










250 










255 




Thr 


Tyr 


Asn 


Asn 


Asn 


Lys 


Ala 


Thr 


Val 


Lys 


Phe 


Asp 


He 


Thr 


Asn 


Asn 






260 










265 










270 






Ala 


Glu 


Gin 
275 


Leu 


Leu 


Asn 


Gin 


Ala 
280 


Ala 


Asn 


He 


Met 


Gin 
285 


Val 


Leu 


Asn 


Thr 


Gin 


Cys 


Pro 


Leu 


Val 


Arg 


Ser 


Thr 


Asn 


Asn 


Glu 


Asn 


Thr 


Pro 


Gly 




290 








295 










300 










Gly Gly 


Gin 


Pro 


Trp 


Gly 


Leu 


Ser 


Thr 


Ser 


Gly Asn 


Ala 


Cys 


Ser 


lie 


3 05 










310 










315 










320 


Phe 


Gin 


Gin 


Glu 


Phe 
325 


Ser 


Gin 


Val 


Thr 


Ser 
330 


Met 


He 


Lys 


Asn 


Ala 

335 


Gin 


Glu 


lie 


He 


Ala 
340 


Gin 


Ser 


Lys 


He 


Val 
345 


Ser 


Glu 


Asn 


Ala 


Gin 
350 


Asn 


Gin 


Asn 


Asn 


Leu 
355 


Asp 


Thr 


Gly 


Lys 


Pro 
360 


Phe 


Asn 


Pro 


Tyr 


Thr 
365 


Asp 


Ala 


Ser 


Phe 


Ala 
370 


Gin 


Ser 


Met 


Leu 


Lys 
375 


Asn 


Ala 


Gin 


Ala 


Gin 
380 


Ala 


Glu 


Met 


Phe 


Asn 


Leu 


Ser 


Glu 


Gin 


Val 


Lys 


Lys 


Asn 


Leu 


Glu 


Val 


Met 


Lys 


Asn 


Asn 


385 










390 










395 










400 


Asn 


Asn 


Val 


Asn 


Glu 


Lys 


Leu 


Ala 


Gly 


Phe 


Gly Lys 


Glu 


Glu 


Val 


Met 










405 










410 










415 




Thr 


Asn 


Phe 


Val 
420 


Ser 


Ala 


Phe 


Leu 


Ala 
425 


Ser 


Cys 


Lys 


Asp 


Gly 
430 


Gly 


Thr 


Leu 


Pro 


Asn 
435 


Ala 


Gly 


Val 


Thr 


Ser 
440 


Asn 


Thr 


Trp 


Gly 


Ala 
445 


Gly 


Cys 


Ala 


Tyr 


Val 


Gly 


Glu 


Thr 


He 


Ser 


Ala 


Leu 


Thr 


Asn 


Ser 


He 


Ala 


His 


Phe 


450 








455 










460 










Gly 


Thr 


Gin 


Glu 


Gin 


Gin 


He 


Gin 


Gin 


Ala 


Glu 


Asn 


He 


Ala 


Asp 


Thr 


465 










470 










475 










480 


Leu 


Val 


Asn 


Phe 


Lys 
485 


Ser 


Arg 


Tyr 


Ser 


Glu 
490 


Leu 


Gly 


Asn 


Thr 


Tyr 
495 


Asn 


Ser 


He 


Thr 


Thr 
500 


Ala 


Leu 


Ser 


Lys 


Val 
505 


Pro 


Asn 


Ala 


Gin 


Ser 
510 


Leu 


Gin 


Asn 


Val 


Val 
515 


Ser 


Lys 


Lys 


Asn 


Asn 
520 


Pro 


Tyr 


Ser 


Pro 


Gin 
525 


Gly 


lie 


LrlU 


Thr 


Asn 


Tyr 


Tyr 


Leu 


Asn 


Gin 


Asn 


Ser 


Tyr 


Asn 


Gin 


He 


Gin 


Thr 


He 




530 








535 










540 










Asn 


Gin 


Glu 


Leu 


Gly 


Arg 


Asn 


Pro 


Phe 


Arg 


Lys 


Val 


Gly 


He 


Val 


Asn 


545 










550 










555 










560 


Ser 


Gin 


Thr 


Asn 


Asn 


Gly 


Ala 


Met 


Asn 


Gly 


He 


Gly 


He 


Gin 


Val 


Gly 








565 










570 










575 




Tyr 


Lys 


Gin 


Phe 


Phe 


Gly 


Gin 


Lys 


Arg 


Lys 


Trp 


Gly 


Ala 


Arg 


Tyr 


Tyr 









580 










585 










590 






Gly 


Phe 


Phe 


Asp 


Tyr 


Asn 


His 


Ala 


Phe 


He 


Lys 


Ser 


Ser 


Phe 


Phe 


Asn 




595 










600 










605 








Ser 


Ala 


Ser 


Asp 


Val 


Trp 


Thr 


Tyr 


Gly 


Phe 


Gly Ala Asp 


Ala 


Leu 


Tyr 


610 








615 










620 










Asn 


Phe 


He 


Asn 


Asp 


Lys 


Ala 


Thr 


Asn 


Phe 


Leu 


Gly 


Lys 


Asn 


Asn 


Lys 


625 








630 










635 










640 


Leu 


Ser 


Leu 


Gly 


Leu 


Phe 


Gly 


Gly 


He 


Ala 


Leu 


Ala 


Gly 


Thr 


Ser 


Trp 








645 










650 










655 




Leu 


Asn 


Ser 


Glu 


Tyr 


Val 


Asn 


Leu 


Ala 


Thr 


Val 


Asn 


Asn 


Val 


Tyr 


Asn 








660 








665 










670 




Val 


Ala 


Lys 


Met 


Asn 


Val 


Ala 


Asn 


Phe 


Gin 


Phe 


Leu 


Phe 


Asn 


Met 


Gly 




675 










680 










685 






Ala 


Arg 


Met 


Asn 


Leu 


Ala 


Arg 


Ser 


Lys 


Lys 


Lys 


Gly 


Ser 


Asp 


His 


Ala 


690 










695 










700 










Gin 


His 


Gly 


He 


Glu 


Leu 


Gly 


Leu 


Lys 


He 


Pro 


Thr 


He 


Asn 


Thr 


Asn 


705 








710 










715 








Tyr 


720 


Tyr 


Tyr 


Ser 


Phe 


Met 


Gly 


Ala 


Glu 


Leu 


Lys 


Tyr 


Arg 


Arg 


Leu 


Ser 






725 










730 










735 




Val 


Tyr 


Leu 


Asn 


Tyr 


Val 


Phe 


Ala 


Tyr 




















740 










745 

















<210> 4 

<211> 2127 

<212> DNA 

<213> Helicobacter pylori 



<400> 
atgaaaaaaa 
gacgacggct 
aacaccaaag 
aattacagca 
gcaagggata 
gcgtatcaag 
tacgctttta 
tttaataatg 
ggacatggtg 
atccaaaagg 
accactacaa 
acacctgaaa 
atagtaacac 
caagcgagca 
agaagttatt 
atcagcgcga 
gttagtgaaa 
acggacgcca 
aaccaagccg 
tctttagggg 
gtaacttcta 
gacaacagca 



cccttttact 
tttacacaag 
gcattcaaga 
ccctaaacac 
atctaggctc 
ccgtgctttt 
ctgcttgtgg 
tgccaggaca 
ggcctatatc 
ctttgacagc 
aacttgattt 
agttcccatg 
caacagaaac 
tcattatcac 
ggcaagggat 
tccaaggcat 
acgcgcaaaa 
gctttgcgca 
aacaagtagt 
tgtgttatga 
acacttgggg 
tcgctcactt 



ctctctctct 
cgtgggctat 
gctttcagac 
ccttatcaaa 
aagctctagg 
agcactcaat 
tcctggcagt 
agatacgacg 
cactgcaaat 
caatggagct 
cactatcaat 
gagtgatggg 
aaatatcaac 
taccctaaat 
aagcggcaat 
gatcgctaac 
tcaaaacaac 
aagcatgctc 
aaaaaacttt 
agtgcaaggg 
agccggttgc 
tggcactcaa 



ctctctctct 
caaatcggtg 
aattatgaaa 
ttgtccgctg 
aatttgcttg 
gctgcagtgg 
aacgagaatg 
accatcactt 
tatgcgaaaa 
aatggagatg 
ggagacaaaa 
aaatatattc 
acagaaaata 
gaggcatgcc 
gggacaatgt 
gctcaagaag 
ttggatactg 
aaaaacgctc 
gaaaaaatcc 
ggtgagcgta 
gcgtatgtga 
gagcagcaga 



cgtttttgct 
aagccgctca 
agctgaacaa 
atccgagcgc 
atgtcaaaac 
ggttgtggca 
cgaatggagg 
gcaattcgta 
tcaatcaagc 
gggtccccgt 
gaacgggggg 
acacccaatg 
acgctcaaga 
caaacttcca 
gcgggatgtt 
ctgtcgcgca 
gaaaaccatt 
aagcgcaagc 
ctacagcctt 
ggggcaccaa 
aacaaaccat 
tacagcaagc 



ccacgctgaa 60 

aatggtgaaa 12 0 

tcttttgaat 180 

gattaacgac 24 0 

caattccccc 300 

agttacaagc 360 

gatccaaact 420 

ttatgagcca 480 

ctatcaaatc 540 

tttaagcaac 600 

caaaccaaat 660 

gattaacaca 720 

gcttttaaaa 780 

aaatggtggt 84 0 

taagaatgaa 900 

aagcaaaatc 960 

caacccttac 1020 

agagatttta 1080 

tgtatcagac 1140 

tccaggtcag 1200 

aacgaattta 1260 

cgaaaacatc 1320 



gctgacactc 
atcaccaccg 
aagaataacc 
tacaaccaaa 
atcgtcaatt 
aagcaattct 
aaccatgcgt 
tttggagcgg 
aacaacaagc 
aattctgagt 
gcgaatttcc 
aaaggcagcg 
aacacgaact 
tatttgaatt 



tagtgaattt 
cgctctccaa 
cctatagccc 
tccaaaccat 
ctcaaaccaa 
ttggccaaaa 
tcattaaatc 
acgctcttta 
tttccgtggg 
atgtgaattt 
aattcttatt 
atcatgcggc 
attattcttt 
atgtgttcgc 



caaatctaga 
agtccctaac 
tcaaggcata 
caaccaagaa 
caatggtgcc 
aagaaaatgg 
cagcttcttc 
taacttcatc 
gctttttgga 
agccaccgtg 
caatatggga 
tcagcatggg 
catgggggct 
ttactaa 



tacagcgaat 
gcgcaaagct 
gagaccaatt 
ctagggcgta 
atgaatggga 
ggcgctaggt 
aactcggctt 
aacgataaag 
gggattgcgt 
aataacgtct 
gtgaggatga 
attgaactag 
gaactcaaat 



taggcaacac 
tgcaaaacgt 
actacctcaa 
acccctttag 
tcggtattca 
attacggctt 
ctgatgtgtg 
ccaccaattt 
tagcgggcac 
ataacgctaa 
atttagccag 
ggcttaaaat 
acagaaggct 



ctataacagc 1380 

ggtgagcaaa 144 0 

tcaaaattct 1500 

gaaagtgggc 156 0 

ggtgggctat 162 0 

ttttgactac 1680 

gacttatggt 1740 

cttaggcaaa 1800 

ttcatggctt 1860 

aatgaatgtg 192 0 

atccaagaaa 1980 

ccccaccatc 2040 

ttatagcgtg 2100 
2127 



<210> 5 

<211> 2226 

<212> DNA 

<213> Helicobacter pylori 



<400 
atgaaaaaac 
gaagacgacg 
acaaacacca 
acccgataca 
gcggcgcgtg 
ccagcctatc 
ggctatgcta 
atcttcaaca 
acgcctggat 
cagatcctcc 
gtcaatgtta 
gccacaggtg 
aaaagcgtaa 
tcacgtgtct 
gcgctcttag 
agtgtaacta 
tttacagaag 
caaacgagcg 
cctttcaacc 
caagctaaaa 
actgggaatt 
gctggcactg 
tccggttgcg 
ggcactcaag 
aaatctagat 
gtccctaacg 
caaggcatag 
aaccaagaat 
aacggcgcca 



> 5 
acatcctttc 
gcttttacac 
aaggcattca 
gcaccctaaa 
aaaatctggg 
aagcggtgct 
cgcaatgcgg 
acgagccagg 
actatggccc 
aaacggcgtt 
cctacactta 
taagcaatca 
ccaccacgat 
cctacaccga 
cgcaagcgag 
atcaaagtgg 
aaatcagcgc 
tcattaatag 
ctttcacgga 
tgctcaattt 
ttaaaaattt 
gtggcacaca 
cgtatgtgga 
agcagcaaat 
acagcgaatt 
cgcaaagctt 
aaaccaatta 
tagggcgtaa 
tgaatgggat 



attagcttta 
aagcgtgggc 
agatctttca 
caccctgatc 
cgcgagcgcg 
tttagcgatc 
gggtaacgcc 
gtatcgatcc 
tatgagtatt 
aaaacaaggc 
cacatgctca 
aaatggcgga 
cagttcaaaa 
aatcaccaac 
cacgctcatt 
tggtccacag 
gatccaaaag 
ccatgaacaa 
cgctcaattc 
agcccatcaa 
tgttacaggc 
aggttcagct 
acaaaccata 
acaacgagcc 
ggggaatact 
gcaaaacgtg 
ctacttgaat 
cccttttagg 
cggtattcag 



ggctcgcttt 
tatcaaatcg 
gatcgttatg 
aaattgtccg 
aagaatttga 
aacgcggcgg 
aatggtcaaa 
acttccatca 
gagaatttca 
ttacccgcgc 
gggggaggga 
actaaaacca 
gtcgttgata 
aaattagaag 
agcaccatca 
atggaaccga 
atgatcacag 
tcaaccctag 
gctcaaggca 
gtggggcaaa 
tttttagcca 
ccaggcacgg 
acgaatttag 
gaaaatatcg 
tacaacagca 
gtgagcaaaa 
caaaattctt 
aaagtgggca 
gtgggctacg 



tagtttccac 
gtgaagccgc 
aaagtttgaa 
ctgatccgag 
tcggcgataa 
tagggttttg 
aaagcacctc 
cttgctcttt 
aaaagcttaa 
tcaaagaaaa 
ataataactg 
ccacccaaac 
gcacagcgag 
gtgtgcctga 
acacggcatg 
ctaaagggaa 
acgcgcaaga 
tgggcggtaa 
tgctcgctaa 
ccattaaccc 
catgcaacaa 
ttaccactca 
aaaacagcat 
ctgacactct 
tcaccactgc 
agaataaccc 
acaaccaaat 
tcgtcggctc 
agcaattctt 



tttgagcgct 60 

tcaaatggta 120 

caaccttttg 180 

cgcgattaat 240 

agccaattcc 300 

gaatgtctta 360 

ttcaacgacc 420 

gaacgggtat 4 80 

cgaagcctat 540 

caacaagaag 600 

ctcgtcagaa 660 

catagacggc 72 0 

tggtaacaca 780 

tagcgctcaa 840 

cccgtttttt 900 

gttgtgcggt 960 

gctggtcaat 102 0 

taatggcaag 1080 

cgctagcgcg 114 0 

taacaatctt 1200 

cccctcaaca 1260 

aactttcgct 1320 

cgcgcacttt 1380 

agtgaatttc 1440 

gctctccaaa 1500 

ctatagcccg 1560 

ccaaaccatc 162 0 

tcaaaccaac 1680 

tggccaaaaa 174 0 



2226 



60 



agaaaatggg gcgctaggta ttacggcttt tttgattaca accatgcgtt tattaaatcc 1800 

agcttcttca actcggcttc tgatgtgtgg acttatggtt ttggagcgga cgctctctat 1860 

aacttcatca acgataaagc cactaacttt ttaggcaaaa acaacaagct ttctgtgggg 1920 

ctttttggcg ggattgcgtt agcgggcact tcatggctta attctgagta tgtgaattta 1980 

gccaccgtga ataatgtcta taacgctaaa atgaacgtgg cgaacttcca attcttattc 2040 

aacatgggag tgaggatgaa tttggccagg cccaagaaaa acgacagcga tcatgcggct 2100 

cagcatggga ttgagttagg gcttaaaatc cccaccatca acacgaacta ctattccttt 2160 

atgggggctg aactcaaata cagaaggctt tatagcgtgt atttgaatta tgtgttcgct 2220 
tactag 

<210> 6 
<211> 2238 
<212> DNA 

<213> Helicobacter pylori 
<400> 6 

atgaaaaaac acatcctttc attagcttta ggctcgcttt tagtttccac tttgagcgct 

gaagacgacg gcttttacac aagcgtaggc tatcagatcg gtgaagccgc tcaaatggta 120 

acaaacacca aaggcatcca acagctttca gacaattatg aaaatttgaa caacctttta 180 

acgagataca gcaccctaaa cacccttatc aaattgtccg ctgatccgag cgcaattaat 240 

gcggtgcggg aaaatctggg cgcgagcacg aagaatttga tcggcgataa agccaactcc 3 00 

ccggcgtatc aagccgtgtt tttagcgatc aacgcggcgg tagggttgtg gaataccatc 3 60 

ggctatgcgg tcatgtgcgg gaacgggaac ggcacagaga gtgggcctgg cagcgtgatc 420 
tttaatgacc aaccaggaca ggattccacg caaattactt gcaaccgctt tgaatcaact 
gggcctggta aaagcatgtc tattgatgaa ttcaaaaaac tcaatgaagc ctatcaaatc 
atccagcaag ctttaaaaaa tcaaagtggg tttcctgaat taggcgggaa cggcacaaaa 
gtgagtgtta attacaatta cgaatgcaga caaactgctg atatcaacgg cggtgtgtat 

cagttctgca aggctaaaaa tggtagtagt agcagtagta atggcggtaa tggcagtagc 720 

acgcaaacaa ccgcgacaac cacgcaagac ggcgtaacga tcaccactac ctataataat 780 

aacaaagcca ccgtcaaatt tgacatcacc aataacgctg aacagctgtt aaatcaagcg 840 

gcaaacatca tgcaagtcct taatacgcaa tgccctttag tgcgttccac gaataacgaa 900 

aacactccag ggggtggtca accatggggt ttaagcacat ccgggaatgc gtgcagcatc 960 

ttccaacaag aatttagcca ggttactagc atgatcaaaa acgcccaaga aataatcgcg 1020 

caaagcaaaa tcgttagtga aaacgcgcaa aatcaaaaca acttggatac tggaaaacca 1080 

ttcaaccctt acacggacgc cagctttgcg caaagcatgc tcaaaaacgc tcaagcgcaa 1140 

gcagagatgt tcaatttgag cgaacaagtg aaaaagaact tggaagtcat gaaaaacaac 1200 

aataatgtta acgagaaatt agcaggattt gggaaagaag aagtaatgac caattttgtt 1260 

agcgcctttt tggcaagctg caaagatggt ggcacattgc ctaatgcagg ggttacttct 1320 

aacacttggg gggcgggttg cgcgtatgtg ggagagacga taagcgccct aaccaacagc 1380 

atcgctcact ttggcactca agagcagcag atacagcaag ccgaaaacat cgctgacact 1440 

ctagtgaatt tcaaatctag atacagcgaa ttaggcaaca cctataacag catcaccacc 1500 

gcgctctcca aagtccctaa cgcgcaaagc ttgcaaaacg tggtgagcaa aaagaataac 1560 

ccctatagcc ctcaaggcat agagaccaat tactacctca atcaaaattc ttacaaccaa 1620 

atccaaacca tcaaccaaga actagggcgt aaccccttta ggaaagtggg catcgtcaat 1680 

tctcaaacca acaatggtgc catgaatggg atcggcattc aggtgggcta taagcaattc 174 0 

tttggccaaa aaagaaaatg gggcgctagg tattacggct tttttgatta caaccatgcg 1800 

ttcatcaaat ccagcttttt caactcggct tctgacgtgt ggacttatgg ttttggagcg 1860 

gacgcgcttt ataacttcat caacgataaa gccaccaatt tcttaggcaa aaacaacaag 1920 

ctttctttgg ggctttttgg cgggattgcg ttagcgggca cttcatggct caattctgag 1980 
tacgtgaatt tagccaccgt gaataacgtc tataacgcta aaatgaatgt ggcgaatttc 



480 
540 
600 
660 



2040 



caattcttat tcaatatggg agtgaggatg aatttagcca gatccaagaa aaaaggcagc 
gatcatgcag ctcagcatgg gattgagtta gggcttaaaa tccccaccat caacacgaac 
tattattcct ttatgggggc tgaactcaaa tacagaaggc tctatagcgt gtatttgaac 
tatgtgttcg cttactaa 

<210> 7 
<211> 20 
<212> PRT 

<213> Helicobacter pylori 
<400> 7 

Glu Asp Asp Gly Phe Tyr Thr-Ser Val Gly Tyr Gin He Gly Glu Ala 

! 5 10 15 

Ala Gin Met Val 
20 

<210> 8 
<211> 33 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward Primer 
<400> 8 

ggccatatga aaaaacacat cctttcatta get 

<210> 9 
<211> 42 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse Primer 
<400> 9 

ggcaagcttg ggagtttcac aaaaagctta gtaagegaac ac 

<210> 10 
<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer for promoter region of 75 kDa gene 



<400> 10 
cgaaatcttg tgataagatc 



<211> 33 
<212> DNA 

<213> Artificial sequence 



<220> 

<223> Primer for 5' start mature 75, 77, and 7kDa genes 



<400> 11 

ccgggcttgg taccggaaga cgacggcttt tac 

<210> 12 

<211> 44 

<212> DNA 

<213> Artificial sequence 



33 



<220> 

<223> Primer for 3' end of 75, 77, and 7kDa genes 
<400> 12 

ccgggcacta gtttagtaag cgaacacata attcaaatac acgc 

<210> 13 
<211> 21 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer for internal 75 kDa specific 

<400> 13 
cggggggcaa accaaataca c 

<210> 14 
<211> 21 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer for internal 75 kDa specific 

<400> 14 
gtgtatttgg tttgcccccc g 

<210> 15 
<211> 36 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer for internal 75 kDa seamless cloning 



<400> 15 

gccgaacgtc tcccctaaat gaggcatgcc caaact 



36 



<210> 16 
<211> 40 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer for internal 7kDa seamless cloning 
<400> 16 

ctacggcgtc tcttagggta gtgataatga tgctcgcttg 40 

<210> 17 
<211> 30 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer for 5' end start mature 75, 77, and 7kDa genes 
<400> 17 

cgcggcgata tggaagacga cggcttttac 3 0 

<210> 18 
<211> 30 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer for 3' end of 75, 77, and 7kDa genes 
<400> 18 

ccggccggta ccttagtaag cgaacacata 30 

<210> 19 

<211> 2069 

<212> DNA 

<213> Helicobacter pylori 



<400> 19 



gaagacgacg gcttttacac aagcgtaggc 
aaaaacacca aaggcatcca agagctttca 
aataattaca gcaccctaaa caccctgatc 
gacgcaaggg ataatctagg ctcaagctct 
ccggcgtatc aagcggtgct tttagcactg 
agctatgctt ttactgcttg tggtcctggc 
acttttaata atgtgccagg acaaaatacg 
ccaggacatg gcgggcctat atccactgca 



tatcaaatcg gtgaagccgc tcaaatggtg 
gacaattatg aaaagctgaa caatcttttg 
aaattatccg ctgatccgag cgcaattaac 
aggaatttgc ttgatgtcaa aaccaattcc 
aatgctgcag tggggttgtg gcaagttaca 
agtaacgaga acgcgaatgg aggtatccaa 
acgaccatca cttgcaattc gtattatgag 
aactatgcga aaatcaatca agcctatcaa 



60 
120 
180 
240 
300 
360 
420 
480 
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atcattcaaa aggctttgac agccaatgga gctaatggag atggggtccc tgttttaagc 54 0 

aacaccacta caaaacttga tttcactatc aatggagaca aaagaacggg gggcaaacca 600 

aataaaccat taatatacca atggagtcat gggaaagcta tttcaacctc gtggaatgac 660 

acacaagcaa caccaacaac agaaaacatc aacacagaaa ataacgctca agagctttta 72 0 

aagcaagcga gcatcattat cactacccta aatgaggcat gcccaaactt tcagaatggt 780 

ggtagcggtt attggcaagg gataagtggc aatgggacaa tgtgtgggat gtttaagaat 84 0 

gaaatcagtg ctatccaagg catgatcgct aacgcgcaag aagctgtcgc gcaaagtaaa 900 

atcgttagtg aaaacgcgca aaatcaaaac aacttggaca ctggaaaacc attcacccct 960 

acacagacgc cagcttcgct caaagcatgc tcaaaaacgc tcaagcccaa gcagagattt 1020 

taaaccaagc cgaacaagtg gtgaaaaact ttgaaaaaat ccctagaaat tttgtatcag 1080 

actctttagg ggtgtgttat gaagtgcaag ggggtgagcg taggggcacc aatccaggtc 1140 

aggtaacttc taacacttgg ggggccggtt gcgcctatgt ggggcaaacc ataacaaatc 1200 

taaaaaacag catcgctcac tttggcactc aagagcagca aatacaacga gccgaaaata 1260 

tcgctgacac tctagtgaat ttcaaatcta gatacagcga attggggaat acttacaaca 1320 

gcatcaccac tgcgctctcc aaagtcccta acgcgcaaag cttgcaaaac gtggtgagca 1380 

aaaagaataa cccctatagc ccgcaaggca tagaaaccaa ttactacttg aatcaaaatt 1440 

cttacaacca aatccaaacc atcaaccaag aattagggcg taaccctttt aggaaagtgg 1500 

gcatcgtcgg ctctcaaacc aacaacggcg ccatgaatgg gatcggtatt caggtgggct 1560 

acaagcaatt ctttggccaa aaaagaaaat ggggcgctag gtattacggc ttttttgatt 162 0 

acaaccatgc gtttattaaa tccagcttct tcaactcggc ttctgatgtg tggacttatg 1680 

gttttggagc ggacgctctc tataacttca tcaacgataa agccactaac tttttaggca 1740 

aaaacaacaa gctttctgtg gggctttttg gcgggattgc gttagcgggc acttcatggc 1800 

ttaattctga gtatgtgaat ttagccaccg tgaataatgt ctataacgct aaaatgaacg 1860 

tggcgaactt ccaattctta ttcaacatgg gagtgaggat gaatttggcc aggcccaaga 192 0 

aaaaagacag cgatcatgcg gctcagcatg ggattgagtt agggcttaaa atccccacca 1980 

tcaacaccaa ctactattcc tttatggggg ctgaactcaa atacagaagg ctttatagcg 2040 

tgtatttgaa ttatgtgttc gcttactaa 2069 

<210> 20 
<211> 2156 
<212> DNA 

<213> Helicobacter pylori 
<400> 20 

gaagaccacg gcttttacac aagcgtaggc tatcaaatcg gtaagccgct caaatggtaa 60 

caaacaccaa aggcatccaa gagctttcag acaattatga aaagctgaac aatcttttga 12 0 

atagttacag caccctaaac acccttatca aattgtccgc tgatccgagc gcggtcagtg 180 

gcgcgatcaa caatttgaac gcgggcgcga cgggtttgct caaagaaaaa accaactccc 24 0 

ccgcctatca agccgtctct ttggcgttaa acgccgcagt gggcttgtgg aataccatcg 300 

gctatgcggt catgtgcggg aacgggaacg gcacatcgaa tgggcctggc agcgtgatct 360 

ttaatggaga gccaggacag ggctccacgc agattacttg caaccgcttt gaatcaactg 42 0 

gtcctggtca cagcatgtct attgatgaat tcaagaaact caatcaagcc tatcaaatca 480 

tccaacaagc tttaaaagga aatgggtttc ctgaactagg caacaacggc acaagcgtga 54 0 

gtgttgaata taaatatgaa tgcaaacaga ataataatga tatcgacggc ggtgtgagac 600 

aattctgcca agcgaaaaat ggtagtagca gtggtagtaa tggcagtagt ggtaatagca 66 0 

agcaaacaac cacgcaagac ggcgtaacga tcaccactac taatagtagt aacgaagcca 72 0 

ccgtcaattt tagcatcacc aataacgctg aacagctgtt aaatcaagcg gcaaacatca 780 

tgcaagtcct taacacgcaa tgccctttgg tgcgatctac gaatgatgaa aacgctccag 84 0 

ggggtggcca accatggggt ttaagcacga cagggaatgc gtgccaaatc ttccaacaag 90 0 

aatttagcca ggttactagc atgatcaaaa acgcccaaga aatcgtcgcg caaagccaaa 960 
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tcgctaacgc taaccaaaaa gcagaaataa ctaaccccag taacttcaac cctttcacgg 1020 

acgctggctt tgcacaaaac atgctcaaaa acgctagagc gcaagcagag atgttcaatt 1080 

tagctgagca agtgaaaaag aatttggaag tcatgcaaaa caacaataac gttaatgcgg 114 0 

aattatcagg atttggggat ggaatgacca attttgttag cgcctttttg gcaagctgca 1200 

aaaatgatgg gacattgcct aatcaagggg ttacttctaa cacttggggg gccggttgcg 1260 

cgtatgtggg agagacgata acggctttaa ataacagcat cgctcacttt ggcactcaag 1320 

ccgagcagat agaacaagcc gaaaacatcg ctgacactct agtgcatttc aaatctagat 13 80 

acaatgaatt gggcaacact tataacagca tcaccaccgc gctctccaaa gtccctaacg 1440 

cgcaaagctt gcaaaatgtg atgggaaaaa agaataaccc ctatagccca caaggcatag 1500 

aaaccaatta ctatctcaat caaaactctt acaaccaaat ccaaaccatc aaccaagaat 1560 

taggccgtaa cccctttagg aaagtgggta ttgtcagctc tcaaaccaac aatggcgcca 1620 

tgaatgggat cggtattcag gtgggctata agcaattctt tggccaaaaa agaaaatggg 1680 

gcgctaggta ttacggcttt tttgattaca accatgcgtt cattaaatcc agcttcttca 1740 

actcggcttc tgacgtatgg acttatggtt ttggagtgga cgctctttat aatttcatca 1800 

acgataaagc caccaatttc ttaggcaaaa acaacaagct ttctgtaggg ctttttggcg 1860 

ggattgcgtt agcgggcact tcatggctca attctgaata cgtgaattta gccaccatga 1920 

ataacgtcta taacgctaaa atggatgtgg cgaacttcca gttcttattc aacatgggag 1980 

tgaggatgaa tttagccaga cccaagaaaa aaggcagcga tcatgcggct cagcatggga 2 04 0 

ttgagttagg gcttaaaatc cccaccatca acacgaacta ctattccttt atgggggctg 2100 

aactcaaata cagaaggctc tatagcgtgt atttgaatta tgtgttcgct tactaa 2156 

<210> 21 
<211> 689 
<212> PRT 

<213> Helicobacter pylori 



<400> 21 
Glu Asp Asp Gly 
1 

Ala Gin Met Val 
20 

Tyr Glu Lys Leu 
35 

Leu lie Lys Leu 
50 

Asn Leu Gly Ser 
65 

Pro Ala Tyr Gin 

Trp Gin Val Thr 
100 

Glu Asn Ala Asn 
115 

Asn Thr Thr Thr 
130 

Gly Pro lie Ser 
145 

fie He Gin Lys 
Pro Val Leu Ser 



Phe Tyr Thr Ser 
5 

Lys Asn Thr Lys 

Asn Asn Leu Leu 
40 

Ser Ala Asp Pro 
55 

Ser Ser Arg Asn 
70 

Ala Val Leu Leu 
85 

Ser Tyr Ala Phe 

Gly Gly He Gin 
120 

He Thr Cys Asn 
135 

Thr Ala Asn Tyr 
150 

Ala Leu Thr Ala 
165 

Asn Thr Thr Thr 



Val 


Gly 


Tyr 


Gin 




10 






Gly 


He 


Gin 


Glu 


25 








Asn 


Asn 


Tyr 


Ser 


Ser 


Ala 


He 


Asn 








60 


Leu 


Leu 


Asp 


Val 






75 




Ala 


Leu 


Asn 


Ala 




90 






Thr 


Ala 


Cys 


Gly 


105 








Thr 


Phe 


Asn 


Asn 


Ser 


Tyr 


Tyr 


Glu 








140 


Ala 


Lys 


He 


Asn 






155 




Asn 


Gly 


Ala 


Asn 




170 






Lys 


Leu 


Asp 


Phe 



He Gly Glu Ala 
15 

Leu Ser Asp Asn 
30 

Thr Leu Asn Thr 
45 

Asp Ala Arg Asp 

Lys Thr Asn Ser 
80 

Ala Val Gly Leu 
95 

Pro Gly Ser Asn 
110 

Val Pro Gly Gin 
125 

Pro Gly His Gly 

Gin Ala Tyr Gin 
160 

Gly Asp Gly Val 
175 

Thr He Asn Gly 
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180 



Asp 


Lys 


Arg 


Thr 






195 




Ser 


His 


Gly 


Lys 




210 






Pro 


Thr 


Thr 


Glu 


225 








Lys 


Gin 


Ala 


Ser 


Phe 


Gin 


Asn 


Gly 








260 


Thr 


Met 


Cys 


Gly 






275 




He 


Ala 


Asn 


Ala 




290 






Asn 


Ala 


Gin 


Asn 


305 








Tyr 


Thr 


Asp 


Ala 


Gin 


Ala 


Glu 


He 








340 


Lys 


He 


Pro 


Arg 






355 




Val 


Gin 


Gly 


Gly 




370 






Asn 


Thr 


Asn 


Gly 


385 








Leu 


Lys 


Asn 


Ser 


Arg 


Ala 


Glu 


Asn 








420 


Ser 


Glu 


Leu 


Gly 






435 




Val 


Pro 


Asn 


Ala 




450 






Pro 


Tyr 


Ser 


Pro 


465 








Ser 


Tyr 


Asn 


Gin 


Pne 


Arg 


Lys 


vai 








500 


Asn 


Gly 


He 


Gly 






515 




Arg 


Lys 


Trp 


Gly 




530 






Phe 


He 


Lys 


Ser 


545 








Gly 


Phe 


Gly 


Ala 


Asn 


Phe 


Leu 


Gly 



Gly Gly Lys Pro 
200 

Ala He Ser Thr 
215 

Asn He Asn Thr 
230 

He He He Thr 
245 

Gly Ser Gly Tyr 

Met Phe Lys Asn 
280 

Gin Glu Ala Val 
295 

Gin Asn Asn Leu 
310 

Ser Phe Ala Gin 
325 

Leu Asn Gin Ala 

Asn Phe Val Ser 
360 

Glu Arg Arg Gly 
375 

Ala Gly Cys Ala 
390 

He Ala His Phe 
405 

He Ala Asp Thr 

Asn Thr Tyr Asn 
440 

Gin Ser Leu Gin 
455 

Gin Gly He Glu 
470 

He Gin Thr He 
485 

Gly He Val Gly 

He Gin Val Gly 
520 

Ala Arg Tyr Tyr 
535 

Ser Phe Phe Asn 
550 

Asp Ala Leu Tyr 
565 

Lys Asn Asn Lys 



185 



Asn 


Lys 


Pro 


Leu 


Ser 


Trp 


Asn 


Asp 








220 


Glu 


Asn 


Asn 


Ala 






235 




Thr 


Leu 


Asn 


Glu 




250 






Trp 


Gin 


Gly 


He 


265 








Glu 


lie 


Ser 


Ala 


Ala 


Gin 


Ser 


Lys 








300 


Asp 


Thr 


Gly 


Lys 






315 




Ser 


Met 


Leu 


Lys 




330 






Glu 


Gin 


Val 


Val 


345 








Asp 


Ser 


Leu 


Gly 


Thr 


Asn 


Pro 


Gly 








380 


Tyr 


Val 


Gly 


Gin 






395 




Gly 


Thr 


Gin 


Glu 




410 






Leu 


Val 


Asn 


Phe 


425 








Ser 


He 


Thr 


Thr 


Asn 


Val 


Val 


Ser 








460 


Thr 


Asn 


Tyr 


Tyr 






475 




Asn 


Gin 


Glu 


Leu 




490 






Ser 


Gin 


Thr 


Asn 


505 








Tyr 


Lys 


Gin 


Phe 


Gly 


Phe 


Phe 


Asp 








540 


Ser 


Ala 


Ser 


Asp 






555 




Asn 


Phe 


He 


Asn 




570 






Leu 


Ser 


Val 


Gly 



190 

He Tyr Gin Trp 
205 

Thr Gin Ala Thr 

Gin Glu Leu Leu 
240 

Ala Cys Pro Asn 
255 

Ser Gly Asn Gly 
270 

lie Gin Gly Met 
285 

He Val Ser Glu 

Pro Phe Asn Pro 
320 

Asn Ala Gin Ala 
335 

Lys Asn Phe Glu 
350 

Val Cys Tyr Glu 
365 

Gin Val Thr Ser 

Thr He Thr Asn 
400 

Gin Gin lie Gin 

415 

Lys Ser Arg Tyr 
430 

Ala Leu Ser Lys 
445 

Lys Lys Asn Asn 

Leu Asn Gin Asn 
480 

Gly Arg Asn Pro 
495 

Asn Gly Ala Met 
510 

Phe Gly Gin Lys 
525 

Tyr Asn His Ala 

Val Trp Thr Tyr 
560 

Asp Lys Ala Thr 
575 

Leu Phe Gly Gly 









580 










585 










590 






He 


Ala 


Leu 


Ala 


Gly 


Thr 


Ser 


Trp 


Leu 


Asn 


Ser 


Glu 


Tyr 


Val 


Asn 


Leu 






595 










600 










605 








Ala 


Thr 


Val 


Asn 


Asn 


Val 


Tyr 


Asn 


Ala 


Lys 


Met 


Asn 


Val 


Ala 


Asn 


Phe 




610 










615 










620 










Gin 


Phe 


Leu 


Phe 


Asn 


Met 


Gly 


Val 


Arg 


Met 


Asn 


Leu 


Ala 


Arg 


Pro 


Lys 


62 5. 










630 










635 










640 


Lys 


Lys 


Asp 


Ser Asp 


His 


Ala 


Ala 


Gin 


His 


Gly 


He 


Glu 


Leu 


Gly 


Leu 










64 5 










650 










655 




Lys 


He 


Pro 


Thr 


He 


Asn 


Thr 


Asn 


Tyr 


Tyr 


Ser 


Phe 


Met 


Gly Ala 


Glu 








660 










665 










670 






Leu 


Lys 


Tyr 


Arg 


Arg 


Leu 


Tyr 


Ser 


Val 


Tyr 


Leu 


Asn 


Tyr 


Val 


Phe 


Ala 






675 










680 










685 








Tyr 

































<210> 22 
<211> 713 
<212> PRT 

<213> Helicobacter pylori 



<400> 22 



Glu 


Asp 


Asp 


Gly 


Phe 


Tyr 


Thr 


Ser 


Val 


Gly 


Tyr 


Gin 


He 


Gly 


Glu Ala 


1 








5 










10 










15 


Ala 


Gin 


Met 


Val 


Thr 


Asn 


Thr 


Lys 


Gly 


He 


Gin 


Glu 


Leu 


Ser 


Asp Asn 








20 










25 










30 




Tyr 


Glu 


Lys 


Leu 


Asn 


Asn 


Leu 


Leu 


Asn 


Ser 


Tyr 


Ser 


Thr 


Leu 


Asn Thr 






35 










40 










45 






Leu 


He 


Lys 


Leu 


Ser 


Ala 


Asp 


Pro 


Ser 


Ala 


Val 


Ser 


Gly 


Ala 


He Asn 




50 










55 










60 








Asn 


Leu 


Asn 


Ala 


Gly 


Ala 


Thr 


Gly 


Leu 


Leu 


Lys 


Glu 


Lys 


Thr 


Asn Ser 


65 










70 










75 








80 


Pro 


Ala 


Tyr 


Gin 


Ala 


Val 


Ser 


Leu 


Ala 


Leu 


Asn 


Ala 


Ala 


Val 


Gly Leu 










85 










90 










95 


Trp 


Asn 


Thr 


He 


Gly 


Tyr 


Ala 


Val 


Met 


Cys 


Gly Asn Gly Asn Gly Thr 








100 










105 










110 




Ser 


Asn 


Gly 


Pro 


Gly 


Ser 


Val 


He 


Phe 


Asn 


Gly Glu 


Pro 


Gly 


Gin Gly 






115 










120 










125 






Ser 


Thr 


Gin 


He 


Thr 


Cys 


Asn 


Arg 


Phe 


Glu 


Ser 


Thr 


Gly 


Pro 


Gly His 




130 










135 










140 








Ser 


Met 


Ser 


He 


Asp 


Glu 


Phe 


Lys 


Lys 


Leu 


Asn 


Gin 


Ala 


Tyr 


Gin He 


145 










150 










155 








160 


He 


Gin 


Gin 


Ala 


Leu 


Lys 


Gly 


Asn 


Gly 


Phe 


Pro 


Glu 


Leu 


Gly Asn Asn 










165 










170 










175 


Gly 


Thr 


Ser 


Val 


Ser 


Val 


Glu 


Tyr 


Lys 


Tyr 


Glu 


Cys 


Lys 


Gin 


Asn Asn 








180 










185 










190 




Asn 


Asp 


He 


Asp 


Gly 


Gly Val 


Arg 


Gin 


Phe 


Cys 


Gin 


Ala 


Lys 


Asn Gly 






195 










200 










205 






Ser 


Ser 


Ser 


Gly 


Ser 


Asn 


Gly 


Ser 


Ser 


Gly Asn 


Ser 


Lys 


Gin 


Thr Thr 




210 










215 










220 









Thr Gin Asp Gly Val Thr He Thr Thr Thr Asn Ser Ser Asn Glu Ala 
225 230 235 240 

Thr Val Asn Phe Ser He Thr Asn Asn Ala Glu Gin Leu Leu Asn Gin 

245 250 255 

Ala Ala Asn He Met Gin Val Leu Asn Thr Gin Cys Pro Leu Val Arg 

260 265 270 

Ser Thr Asn Asp Glu Asn Ala Pro Gly Gly Gly Gin Pro Trp Gly Leu 

275 280 285 

Ser Thr Thr Gly Asn Ala Cys Gin He Phe Gin Gin Glu Phe Ser Gin 

290 295 300 

Val Thr Ser Met He Lys Asn Ala Gin Glu He Val Ala Ser Gin He 
305 310 315 320 

Ala Asn Ala Asn Gin Lys Ala Glu He Thr Asn Pro Ser Asn Phe Asn 

325 330 335 

Pro Phe Thr Asp Ala Gly Phe Ala Gin Asn Met Leu Lys Asn Ala Arg 

340 345 350 

Ala Gin Ala Glu Met Phe Asn Leu Ala Glu Gin Val Lys Lys Asn Leu 

355 360 365 

Glu Val Met Gin Asn Asn Asn Asn Val Asn Ala Glu Leu Ser Gly Phe 

370 375 380 

Gly Asp Gly Met Thr Asn Phe Val Ser Ala Phe Leu Ala Ser Cys Lys 
385 390 395 400 

Asn Asp Gly Thr Leu Pro Asn Gin Gly Val Thr Ser Asn Thr Trp Gly 

405 410 415 

Ala Gly Cys Ala Tyr Val Gly Glu Thr He Thr Ala Leu Asn Asn Ser 

420 425 430 

He Ala His Phe Gly Thr Gin Ala Glu Gin He Glu Gin Ala Glu Asn 

435 440 445 

He Ala Asp Thr Leu Val His Phe Lys Ser Arg Tyr Asn Glu Leu Gly 

450 455 460 

Asn Thr Tyr Asn Ser He Thr Thr Ala Leu Ser Lys Val Pro Asn Ala 
465 470 475 480 

Gin Ser Leu Asn Val Met Gly Lys Lys Asn Asn Pro Tyr Ser Pro Gly 

485 490 495 

He Glu Thr Asn Tyr Tyr Leu Asn Asn Ser Tyr Asn Gin He Gin Thr 

500 505 510 

He Asn Gin Glu Leu Gly Arg Asn Pro Phe Arg Lys Val Gly He Val 

515 520 525 

Ser Ser Gin Thr Asn Asn Gly Ala Met Asn Gly He Gly He Gin Val 

530 535 540 

Gly Tyr Lys Gin Phe Phe Gly Gin Lys Arg Lys Trp Gly Ala Arg Tyr 
545 550 555 560 

Tyr Gly Phe Phe Asp Tyr Asn His Ala Phe He Lys Ser Ser Phe Phe 

565 570 575 

Asn Ser Ala Ser Asp Val Trp Thr Tyr Gly Phe Gly Val Asp Ala Leu 

580 585 590 

Tyr Asn Phe He Asn Asp Lys Ala Thr Asn Phe Leu Gly Lys Asn Asn 

595 600 605 

Lys Leu Ser Val Gly Leu Phe Gly Gly He Ala Leu Ala Gly Thr Ser 
610 615 620 



Trp 


Leu 


Asn 


Ser 


Glu 


Tyr 


Val 


Asn 


Leu 


Ala 


Thr 


Met 


Asn 


Asn 


Val 


Tyr 


625 










630 










635 










640 


Asn 


Ala 


Lys 


Met 


Asp 


Val 


Ala 


Asn 


Gin 


Phe 


Leu 


Phe 


Asn 


Met 


Gly Val 










645 










650 










655 




Arg 


Met 


Asn 


Leu 


Ala 


Arg 


Pro 


Lys 


Lys 


Lys 


Gly 


Ser 


Asp 


His 


Ala 


Ala 








660 










665 










670 






Gin 


His 


Gly 


He 


Glu 


Leu 


Gly 


Leu 


Lys 


He 


Pro 


Thr 


He 


Asn 


Thr 


Asn 






675 










680 










685 








Tyr 


Tyr 


Ser 


Phe 


Met 


Cys 


Ala 


Glu 


Leu 


Lys 


Tyr 


Arg 


Arg 


Leu 


Tyr 


Ser 




690 










695 










700 










Val 


Tyr 


Leu 


Asn 


Tyr 


Val 


Phe 


Ala 


Tyr 
















705 










710 























<210> 23 
<211> 20 
<212> PRT 

<213> Helicobacter pylori 
<400> 23 

Cys Ala Ser Gly Asn Thr Ser His Val He Thr Asn Lys Leu Asp Gly 

15 10 15 

Val Pro Asp Ser 
20 

<210> 24 
<211> 18 
<212> PRT 

<213> Helicobacter pylori 
<400> 24 

Cys Ser Pro Ser Val Asn Gly Thr Lys Thr Thr Thr Gin Thr He Asp 

1 5 10 15 

Gly Lys 



<210> 25 
<211> 12 
<212> PRT 

<213> Helicobacter pylori 
<400> 25 

Cys Tyr Pro Glu Asn Leu Ser Glu Asn Phe Lys Asn 
15 10 

<210> 26 

<211> 12 

<212> PRT 

<213> Helicobacter pylori 



<400> 26 



Cys Asn Pro Glu Asn Leu Ser Glu Asn Phe Lys Asn 
15 10 



<210> 27 
<211> 8 
<212> PRT 

<213> Helicobacter pylori 
<400> 27 

Cys Ser Gly Gin Gly Asn Asn Asn 
1 5 

<210> 28 
<211> 22 
<212> PRT 

<213> Helicobacter pylori 
<400> 28 

Cys Val Met Lys Asn Asn Asn Asn Val Asn Glu Lys Leu Ala Gly Phe 

15 10 15 

Gly Lys Glu Glu Val Met 
20 

<210> 29 
<211> 18 
<212> PRT 

<213> Helicobacter pylori 
<400> 29 

Cys Lys Ala Lys Asn Gly Ser Ser Ser Ser Ser Asn Gly Gly Asn Gly 

1 5 10 15 

Ser Ser 



<210> 30 
<211> 16 
<212> PRT 

<213> Helicobacter pylori 
<400> 30 

Cys Leu Val Arg Ser Thr Asn Asn Glu Asn Thr Pro Gly Gly Gly Gin 
15 10 15 

<210> 31 

<211> 16 

<212> PRT 

<213> Helicobacter pylori 



<400> 31 

Cys Leu Val Arg Ser Thr Asn Asn Glu Asn Thr Pro Gly Gly Gly Gin 



1 



5 



10 



15 



<210> 32 
<211> 14 
<212> PRT 

<213> Helicobacter pylori 
<400> 32 

Cys Arg Gin Thr Ala Asp lie Asn Gly Gly Val Tyr Gin Phe 
15 10 

<210> 33 
<211> 60 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide used to align to the H. pylori genome 
<220> 

<221> Unsure 

<222> 12, 21, 24, 27, 30, 42, 48, 51, and 60 
<223> n is a or g or c or t/u 

<400> 33 

gargaygayg gnttytayac nwsngtnggn taycarathg gngargcngc ncaratggtn 60 





<210> 


34 








<211> 


20 








<212> 


PRT 








<213> 


Helicobacter pylori 








<400> 


34 






Glu 


Asp Asp 


Gly Phe Tyr Thr Ser Val 


Gly Tyr Gin 


He Gly Glu 


1 




5 


10 


15 


Ala 


Gin Met 


Val 










20 








<210> 


35 








<211> 


21 








<212> 


PRT 








<213> 


Helicobacter pylori 







<400> 35 

Glu Asp Asp Gly Phe Tyr Thr Ser Val Gly Tyr Gin He Gly Glu Ala 

15 10 15 

Ala Gin Met Val Lys 
20 



<210> 36 



<211> 15 
<212> PRT 

<213> Helicobacter pylori 
<400> 36 

Ser Thr Ser Ser Thr Thr lie Phe Asn Asn Glu Pro Gly Tyr Arg 
1 5 10 15 . 

<210> 37 

<211> 14 

<212> PRT 

<213> Helicobacter pylori 

<400> 37 

Thr Gly Gly Lys Pro Asn Lys Pro Leu lie Tyr Gin Trp Ser 
1 5 10 

<210> 38 
<211> 9 
<212> PRT 

<213> Helicobacter pylori 
<400> 38 

Thr Thr Thr Gin Thr lie Asp Gly Lys 
1 5 

<210> 39 
<211> 13 
<212> PRT 

<213> Helicobacter pylori 
<400> 39 

Asn Ser He Ala His Phe Gly Thr Gin Glu Gin Gin He 
1 5 10 

<210> 40 
<211> 13 
<212> PRT 

<213> Helicobacter pylori 
<400> 40 

Val Pro Asn Ala Gin Ser Leu Gin Asn Val Val Ser Lys 
15 10 

<210> 41 
<211> 14 
<212> PRT 

<213> Helicobacter pylori 



Ser Lys Lys Asn Asn Pro Tyr Ser Pro Gin Gly lie Glu Thr 
15 10 



<210> 42 
<211> 7 
<212> PRT 

<213> Helicobacter pylori 



<400> 42 
Asn Tyr Tyr Leu Asn Gin Asn 
1 5 



